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Chapter 1 


Introduction 


1.1 The Laws of Motion 


The three laws that govern motion are usually attributed to Isaac Newton. 
They are: 


1. Every body continues in its state of rest or of uniform motion in a 
straight line unless it is compelled to change that state by forces im- 
pressed upon it. 


2. Rate of change motion is proportional to the impressed force, and is in 
the direction in which the force acts. 


3. To every action there is always an equal an opposite reaction. 


The bodies referred to in the First Law should be taken to be particles. 
Particles are idealizations of objects that have no size. The Laws of Motion 
can, in most cases, be generalized to systems of interacting particles, although 
care must be taken in some specific instances. 

The tendency of a particle to continue in a state of rest or uniform motion 
is called inertia. Mass is the quantitative measure of inertia of a body and it 
is given numerical value by comparison to a standard. An inertial reference 
frame is one in which Newton’s First Law holds strictly. Such a frame is also 
an idealization rather than a physically realizable frame. 

Changes in position are measured in terms of distances and times. Dis- 
tances and times are given quantitative meaning by comparison to standards. 


The concept of motion in the Second Law can be given quantitative mea- 
sure as the momentum of a particle. If m is the mass and v is the rate of 
change of position, then the momentum p is given by p = mv. Newton’s 
Second Law then can be expressed as a vector equation, 


dp dv 
mane amr, 

Conceptually, forces are pushes or pulls. The fundamental forces of clas- 
sical mechanics are either the gravitational force or the electromagnetic force 
or some manifestations (friction, viscosity, contact, etc.) of these which may 
obscure their true origins. Forces are given quantitative meaning through 
Newton’s Second Law which connects force to quantitative measures of mass, 
length and time. 

Newton’s three Laws of Motion are based on experiment and cannot be 
proved or derived. 


1.2. Vectors 


There are two aspects to motion. The first is called kinematics and the 
second is called dynamics. Kinematics is purely the description of motion 
independent of the laws of physics which govern it. The latter study is 
dynamics. We are particularly interested in the dynamics of particles and 
systems, but before we turn to dynamics, we have to develop a mathematical 
language for describing motion. 

We will be concerned with two kinds of quantitites. The first, called scalar 
quantities, are characterized by magnitude only and are represented by or- 
dinary real numbers. Time and temperature are scalar quantities. Other 
quantitites have the combined characteristics of magnitude and direction. 
These are represented by mathematical objects called vectors. Relative po- 
sition, velocity, force and acceleration are vector quantities. 

In three-dimensional space, you can think of a vector as a directed line 
segment having length (magnitude) and direction. You may also think of it 
in terms of an ordered set of three scalar components. In texts, vectors are 
usually represented in bold face. 


a= (a4, a2, a3) 


The essential characteristics of vectors are the following: 
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1.2.1 Equality of Vectors 


Two vectors a and b are equal if and only if their respective components are 
equal, i.e. a= b means that a, = b,, ag = by and a3 = b3. 


1.2.2 Vector Addition 
a+b = (a1, 2, a3) + (b1, be, bs) = (a1 + b1, a2 + be, ag + bs) 
1.2.3. Vector Subtraction 
a—b=a+(—1l)b = (a; — bi, a2 — bo, a3 — b3) 


1.2.4 Null Vector 


There exists a null vector 0 such that a+ 0 =a. 

1.2.5 Commutative Law of Addition 
at+b=b+a 

1.2.6 Associative Law of Addition 


a+(b+c)=(a+b)+c 
In component form this can be written: 
(a1 + (b1 + 1), a2 + (bz + C2), a3 + (b3 + €3)) 


= ((ay + b1) + cr, (Gg + be) + C2, (ag + 53) + 3) 


1.2.7 Multiplication of a Vector by a Scalar 


There are several kinds of multiplication defined for vectors. Let p be a scalar 
and let a be a vector. Then, 


pa= p(ai, a2, a3) = (pay, paz, pas) 


1.2.8 Scalar Multiplication of Vectors; Dot Product 


There are two kinds of multiplication defined for vectors with vectors. The 
first is called the dot product. 


a-b = |a||b] cos6 = S> a;b; = ajbj. 


Here, |a| is the magnitude of a, i.e., 


la] = at + a3 + a3 = Vajaj, 


and @ is the angle between the directions of a and b. Very often, for brevity, 
an index such as 7 in the expression a;a; that is repeated exactly twice will 
imply a summation on that index. In such cases the summation sign > is 
not written and the index 7 is said to be a dummy index because when you 
actually write out the expression, it is replaced by 1s, 2s and 3s. Such is the 
case for 

a;a; = S- a,aj; = aa + as + a3 = j4;. 
The dot product of two vectors is itself a scalar; hence this form of multipli- 
cation is described as “scalar multiplication” of vectors. 

The following are theorems for the dot product: 


a-b=b-:a 


a-(b+c)=a-b+a-c 
p(a- b) = (pa) - b =a- (pb) = (a: b)p 


1.2.9 Vector Multiplication of Vectors; Cross Product 


The second kind of multiplication produces a vector. Again @ is the angle 
between the directions of a and b. We then have, 


c=ax b= |al|b| sin én, 


where n is a vector of unit length and direction perpendicular to the plane 
which contains both a and b. The direction is further defined by a right 
hand rule, i.e., the direction of n is the direction the thumb of our right 
hand would be pointing if you pointed the fingers of your right hand along 
the direction of a and curled them toward the direction of b. In truth, the 


cross product is not a true vector, but rather is a pseudovector. It does have 
magnitude and components like a vector, but its direction is ambiguous and 
has to be defined artificially by a right-hand-rule. Physical quantities, such 
as the magnetic field, that are represented by pseudovectors do not have a 
true direction, but are given one artificially by a right-hand-rule. 

We may also write, 


G= S- Oi jk; OK = OijkAj On 


yk 


where, again, the repeated indices 7 and k indicate a sum and are dummies. 
The symbol 06;;, is a 3 x 3 xX 3 matrix called the Levi-Civita tensor. If i, 7 
and k have the values 1, 2 and 3 respectively or any even permutation of 
(123), then 6;;, = +1. If the values of i, 7, and k respectively are an odd 
permutation of (123) then 6;;, = —1. Otherwise (such as in the case of two 
indices being the same), 6;;, = 0. (Think of the indices as labeled beads on a 
loop of string. An even permutation is one you can get by simply moving the 
beads counterclockwise or clockwise along the loop to change their order as 
they sit in your hand. Thus (312) and (231) are even permutations of (123). 
An odd permutation, on the other hand, is one for which you would have 
to exchange the positions of two adjacent indices. Thus (213) and (132) are 
odd permutations of (123).) While this form takes a little getting used to, 
it can be a powerful way of writing and manipulating vectors. Please note 
that it is the ith component and not the full pseudovector that is given by 
the subscript form. 
We have the following theorems: 


axb=-bxa 
ax (b+c)=axb+axc 


p(a x b) = (pa) x b =a x (pb) = (ax b)p 


1.2.10 Unit Vectors 


We define a special set of vectors that have unit length. Usually (almost 
always) we will choose three vectors that are mutually orthogonal. Thus, 


é) = (1, 0, 0) 


€, = (0, 1,0) 
é; = (0,0, 1) 

However, observe that in this instance the subscripts are used to number 
the three unit vectors rather than specify components of the vectors. With 
this in mind and given that the three unit vectors are ordered by a right- 
hand-rule, i.e., that the direction of 63 is the direction of €; x €9,, we have 


6; nes = On 


where 0;; is a matrix whose values are 1 if 7 = 7 but 0 otherwise. Using the 


convention that a repeated index implies summation on that index, we may 
also write, 


e; xX ej = Oijk@k- 


An arbitrary vector may be written in terms of unit vectors, 


a = (41, d2, 43) = a6, + A2€q + 363. 


1.2.11 Derivatives of Vectors 


The derivative of a vector with respect to a scalar is defined by: 
da _ a(t+ At) — a(t) 
dt roar At 


One may easily show that 


d a. ae 

qh?) usa 

d db da 

qa b)=a ae aE 

d db da 
(are by = aS — b 

go a, 


As long as the direction and length of the unit vectors is unchanging, we 
may write, 


1.2.12 Directional Derivative, dypa 


1. w(a1, 22,23) = c where c is a constant defines a surface in Cartesian 
coordinates. If the constant c is changed by a small amount, a second 
surface near the first is defined. An example of such a surface would 
be the sphere defined by 


e+y +2 =a’. 


In the example, a is the radius of the spherical surface. Changing a 
creates a nested set of spherical surfaces. 


2. Suppose P and Q are two points that are on separate surfaces that are 
close together so that P and Q are close together. The chain rule of 
differentiation gives us an estimate of the change in y~ as one moves 
from P to Q. 

Ow Ow Ow 
d = ——dz, + ——dr. + ——dr3. 
PQ Or, : Ox 0x3 ; 
Using subscript notation and summation convention, this can be writ- 
ten, 


We may also define a vector ds which connects P to Q, 
ds = dx 1e1 + dx. + dx3e3 = dx;e; 


and define the gradient of w as, 


Me ~ Or, : Ox 0x3 


Thus, 
dpa _ Vw - ds. 


3. Observe that if P and Q were in the same surface, dwpg = 0 for a 
ds connecting them. This means that the gradient would have to be 
perpendicular to the surface at each point on the surface. 


1.2.13 Transformation of Vectors 


Consider two sets of orthogonal unit vectors 6), 62,63 and €/, 6,65. The 
second set is rotated in a completely general way relative to the first so that 
there is an angle aj; between €{ and €;, an angle aj between €{, and és, etc. 
We may use either of these two sets of unit vectors to express an arbitrary 
vector a, 
a = a;6; = a. 

We may form a dot product of these expressions with one of the unit vectors 
of the second set to form, 


a! Alia a 
But €; - €; = 6;;. Thus 
a = cos(aji) a4. 


If we define a matrix S of the cosines of the angles between the primed and 
unprimed axes, the elements of the matrix are defined to be S;; = cos(a;;), 
1.€., 

a; = eerie 


Using a similar argument one can also show that 
a; = Sia’, 
pe aga: 


The matrix S' is called the transformation matriz, the elements of the matrix 
Si; are called the transformation coefficients and the set of cos a;; are called 
the direction cosines of the primed axis set relative to the unprimed set. It 
is also easily shown that 

6 = Si58;, 


where, again, the repeated index implies summation. 


1.2.14 Index notation for Cartesian vectors 


The symbol 0;; is called the Kronecker delta. It is a 3x3 matrix whose 
elements are 1 if i = 7 and 0 if 7 # 7. The symbol 06,;, is called the Levi- 
Civita tensor. It is a 3x3x3 matrix whose elements are 1 if (ijk) stand for 
an even permutation of (123), -1 if (ijk) stand for an odd permutation of 
(123), and 0 if any two indices are the same. (123) are labels for the axes of 
a right-handed, Cartesian coordinate system. 
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Unless otherwise indicated, it is assumed that there is a summation on 
repeated indices. 


a-b=aj,b; (dot product) 
cH=axb 5 Cy = O4;405 0p (cross product) 
det(aj;) = 04402503, 0;7k 


det(aj;) = Aj14;20 430; 5% (determinant) 
V-a= Via; (divergence) 
C= Va 5 Ci = Oi V 5G (curl) 

Vw = 6,Viv (gradient) 
V20 = ViVi (Laplacian) 


The rules are: 

1. Repeated indices mean “sum”. The repeated index is a dummy and 
may have any symbol as an index that is otherwise unused in a particular 
term. 


2. No more than two identical indices are allowed in any term. 

3. The operator V; = 0/Ox; operates on everything to its right. 
Example: V,a;b, = aj Vibe + bp Via; 

4. The operator V; operating on a Cartesian coordinate, x; yields 1 if 


i = j, but yields 0 if 1 4 7. 


Wi; = Ops where 
bi; =0 if ij 
oe Wt (Kronecker delta) 


5. Products of Kronecker deltas may be reduced if an index is repeated 
(summation). 
OijOkj = Oik 
but Oag0ag = On =3 (trace of Os) 


6. Relationships among elements of the Levi-Civita tensor. 
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Oijk = —Ojik = —Sikg = —On i (odd permutations) 
Oijk = Onig = Oj: (even permutations) 


7. Products of Levi-Civita tensors may be reduced if an index is repeated 
(summation) with the following pattern: 


OcseOe = jlOkm = OjmOkI 


Example: 

(ax V)-r = 0ijnAjV EXi 
= OijkAj Oki 
= 05g; 
=) 


1.3. Position, Velocity, Acceleration 


The position r of a particle relative to a chosen origin is a vector quantity. 
Like any other vector, it can be written, 


r = jj. 


The magnitude of r is a scalar quantity given by r = ,/x;x7;. The time 
derivative of r is the velocity v, 


dr, ae az de; 
SS == SS FF SS — (x;€;) = VeGj + dq aa 


dt dt dt 


In a coordinate system that is fixed in space, the unit vectors are constant 
in magnitude and in direction. In this case the time derivatives of the unit 
vectors vanish and we may write, 


v= Lj Gj. 
The magnitude of the vector v is the speed v. In a fixed coordinate system, 
Vv=VvV Lj Xj. 
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The time derivative of the velocity is the acceleration, 


dvi... @¥ 


a= —= Vv — 
dt dt? 
In a fixed coordinate system, 


diy. 
a= ae = Lj. 

Sometimes it is convenient to use a set of unit vectors that is not fixed 
in space. While the unit vectors continue to have constant unit length, their 
directions may be changing. As a result the unit vectors have time derivatives 
that do not vanish. There are two circumstances where this commonly occurs. 

First, consider a particle moving along a curved path through space. Let 
7 be a unit vector tangent to the particle’s path. As the particle moves from 
point to point along the curved path, 7 changes direction. At two times 
separated by At, 

T(t + At) = 7(t) + dr. 


You can think of the three vectors in this expression forming a small triangle 
with an angle d@ between 7(t + At) and 7(t). Then, 


V = UT 


and, 
dv dv, dt 
a= Tee rg + Bias 
If At is very small, dO is very small and d7 is (almost) perpendicular to 
7. Let n be a unit vector along d7 in the limit that At — 0. The magnitude 
of dv is (1)d0. Hence, dt = dn. Dividing by dt, we have, 


Gr dO. ig 
ie eens 
and, 
du . . 
Be UU, 


For At small, the particle can be thought of as moving along a path that can 
be approximated as a short segment of a circle with radius p. Then v = p@ 


and, 
a= a arm 
dt p 
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In general the radius of curvature p will be a function of the position of the 
particle along its trajectory. The formula is instantly applicable to particles 
moving in circles for which p is the radius of the circle. We observe that accel- 
eration has a tangential component dv/dt long its tangent and a centripetal 
component v?/p towards the center of the arc of its trajectory. Compare this 
with the form of a = 7#;€; in a fixed coordinate system. Both are correct. 
The two are just different ways of describing the same thing. 

Second, consider a frame of reference F’ with a set of unit vectors é%. 
Assume that the set is rotating in some arbitrary fashion. To characterize 
the rotation, we will invent a pseudovector which we will call the angular 
velocity w. We will imagine the instantaneous rotation to be about some axis 
along which we will put a unit vector X. To give a direction to w we will 
define a positive rotation along N by a right-hand-rule, i.e., the rotation is 
positive if the thumb of your right hand points along A when your fingers 
curl forward in the direction of the rotation. 

As the result of an infinitesimal rotation of the system of unit vectors, 
the unit vector €; changes direction such that 


é)(t + At) = é)(t) + dé}. 

Since At is small, dé) is perpendicular to é) so that we may write, 
dé, = (ds2)€, + (ds3)é 

where ds2 = (1)d03 and ds3 = (1)d09. Dividing by dt, 


dai dO yb 


= 4 é! = 626!, + 684. 
de Gon” Ge Bt 


If we now identify the 6s to be the components of a pseudovector w in the 
primed set, then 


The negative sign in front of w) arises because a right-hand screw twisted in 
the sense of increasing 02 is a negative rotation about €g by the right-hand- 
rule that gives directions to rotations. We can derive similar expressions for 
é, and ée3. All three such relationships are summarized by, 


Now consider two frames F' and F” each with origins O and O’ and sets 
of unit vectors é; and é/ respectively. The unprimed frame is fixed in space 
and its unit vectors are constant. Assume that F” is moving relative to F’ 
with arbitrary translation and rotation. Let r be the position of a particle 
relative to O, r’ be the position of the same particle but with respect to O’. 
Finally, let R be the position of O’ with respect to O. Then 


r=r'+R. 


The vectors r, r’ and R can all be expressed either in terms of e; or ef} with 
different sets of components in the two systems related by a transformation. 
One of the several ways we can write r = r’ + R in terms of unit vectors is, 


A fal A 
He; = @,e, + AGE; 


Be careful to note that we have chosen to express r’ in terms of é€; but have 
expressed r and R in terms of €;. We could also have written r’ = ze; where 
the asterisk is used to indicate components of r’ in the unprimed system, but 
it would not have served our purpose as you will see below. 

If we differentiate with respect to time, 


de’ 
iO; = LE, + x ae + Xe; 
Using, 
wash =wxé 
dt - 
we have, 


vevitwxr'+Vg. 


If we differentiate again with respect to time, 


dv _ dv’ tp dws eqeicauie dr’ rf dV 5 
dt dt dt dt dt 
Using, 
dv 
at 
/ 
~ = “(ie =a’twxv’ 


a tie) =vitwxr’ 
dV 
dt 0 


we then have 
a=al+2Qwxvit+wxr'+wx (wxr’)+ Apo. 


In this form, we have the acceleration expressed as a series of terms. The 
first term on the right is the acceleration of the particle relative to O’, the 
second is called the Coriolus term, the third is the transverse acceleration, 
the fourth is the centripetal acceleration and the fifth is the acceleration of 
O’ relative to O. We will return to this equation later, but derive it here to 
emphasize that when the unit vectors are themselves changing with time, 
care must be taken if motion is to be correctly described in such a system. 


1.4 Problems 


1. Let 


and the matrix A, 


Evaluate explicitly 
a-b= ajb;, 


GQ = dijkaj Dk, 
qG= AG tls; 
C= Ajia;, 

Aii, 
d= a, Aid; 
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2. To gain experience in using the summation convention for Cartesian 


vectors, prove the following: 


a- b= 


b-a 


axb=-bxa 
a x (b x c) = b(a-c) —c(a- b) 
ax (b+c)=axb+axc 
(a x b) x (ex d) =(ax b-d)c—(axb-c)d 
(a x b)- (ec x d) = (a-c)(b- d) — (a-d)(b-c) 


a-bxc= 


a, a2 a3 
by by bs 
Cy CoQ C3 


3. A vector a has components (1,1, 1) in a Cartesian system. What are its 
components in a system obtained by a +60° rotation about the x-axis? 
Note that the sense of positive rotation is given by a right-hand-rule. 


Ie 


Chapter 2 


Generalized Coordinate 
Systems 


2.1 Generalized Coordinate Systems 


Cartesian coordinates are the familiar rectangular coordinates x;. They have 
the dimensions of |length] so that speed has the dimensions of [length] /|time] 
and acceleration has the dimensions of [length]/[time]?. But what if you wish 
to use plane polar coordinates, r and 6? Theta does not have the dimension 
of length. How then should we write down an expression for acceleration in 
terms of polar coordinates? What we attempt below is to deal with so-called 
generalized coordinates so that we see how to write down correct expressions 
for positions, velocities and accelerations. We will try to do it in a general 
way so that we do it once and for all for plane polar coordinates, cylindrical 
coordinates, spherical coordinates, parabolic coordinates and some coordi- 
nate systems you haven’t even made up yet. 

Unless otherwise indicated, it is assumed that there is a summation on 
repeated indices. 


2.1.1 Definitions 


é; (unit vectors) 

a= a,é; (physical components, a;) 

q', dq’ (generalized coordinate, differential) 
ds (differential line element) 
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=" pgs 
a= A'b; 
bi = Vq' 
a= A;b* 
Gig = b; QB b,; 
g? —b’-b’ 
_ Obi 

ri = bP: Det 

— 1/2m(ds/dt)? 
. 
fo eng 


2.1.2 Theorems 


a; = ./GaA' (no sum on (ii)) 


ds” = g;,dq'dq 
b’ . b; = 0% 
959" = OF 

Ai = 9i3 A? 

b; = gijb 

b' = gb 

Ob; __ db; 


re 
i= = g'"' 5 (Ogiu/Oq + Ogji/Oq' — 


Ifa a,8; At b; 


A‘ = (G* + @@Ti) 


A, = 49nd") — 
a = STi 
as = /G9" Aj 


(basis vectors, b;) 
(contravariant components, A’) 
(reciprocal basis vectors, b‘) 
(covariant components, A;) 
(covariant metric tensor, gj; ) 
(contravariant metric tensor, g’’) 
(Christoffel symbols, I) 
(kinetic energy) 

(potential energy) 

(covariant force component, F;) 
(Lagrangian) 


0gi;/0q') (Christoffel symbols) 


A;b’ is acceleration: 


(no sum on (ii)) 
(no sum on (ii)) 


As a practical matter, we use this machinery in the following manner. 
We write down an arbitrary ds from which we form ds? = ds-ds. We 
divide the expression for ds? by dt? to form (ds/dt)? and from this we form 

= (1/2)m(ds/dt)*. We then operate on T according to the prescription, 


Ay = 7 (Gm) a a 
m \ dt dq Oq 
to obtain A;. From the A; we obtain the A‘ and from the A’ we obtain the 
physical components of acceleration, a;. These expressions tell us how to 
write down correctly the acceleration in our particular coordinate system. 
Since F;,, = mA;, we may also obtain the generalized forces if we know the 


acceleration a priori. In systems where a potential energy function, V, exists 
and we can write, 


~ Ogk 
we can dispense with most of the machinery by defining CL = T’ — V and 
operating on L according to the following prescription: 
d eae OL 
dt 0g*’ = Og* 
These latter equations are Lagrange’s Equations and they are the differ- 
ential equations of motion of the system. 
In problems with particle motion in electric and magnetic fields, the La- 
grangian, L, becomes: 


L=T-V-—evy+(e/c)(A-v) 


where e is the charge on the particle, ~ is the electric potential, A is the 
magnetic vector potential and v is the velocity of the particle. 


Example: Plane polar coordinates 


A 


e; =f €) = O 
x1 =rcosé r=4/(2} +23) 
2 =rsind 6 = arctan(x2/21) 


Or /Ox, = 21/\/ (27 + #3) ) = cosé Or /Oxg = X2/\/ (x7 + x3) = sind 
00/Ox, = —X2/(x? +23) =—sinO/r 00/Oxr2 = 21 /(x7 + 23) = cos6/r 
Ox, /Or = cosé Ox, /00 = —rsind 
Ox /Or = sind Ox2/00 = rcosé 
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2.1.3 An Example of Generalized Coordinates 


Coordinates are used to specify the position of a particle in space. In a three- 
dimensional space, we need three such coordinates. The three Cartesian 
coordinates (2 ,%2,23) are an example. Spherical coordinates, cylindrical 
coordinates and plane polar coordinates are alternatives that are sometimes 
used. Consider, for example, spherical coordinates (r,@,@). Observe that 
while the Cartesian coordinates all have dimensions of length, only r in the 
set of spherical coordinates has dimensions of length. Newton’s Second Law 
of Motion can easily be expressed in Cartesian coordinates as 


where the second derivative of x; with respect to time will have the dimen- 
sions of acceleration if Cartesian coordinates are used. But, how should we 
express the Second Law if we use coordinates that may not even have dimen- 
sions of length? In what follows we shall try to solve this problem once and 
for all for all admissible generalized coordinates. The development is some- 
what abstract and general, and it will probably be useful to keep a concrete 
example in mind. 
Consider spherical coordinates, 


r= \/02+034 23 


6 = arctan( 
£3 


o= arctan(—). 
X4 
Think of each of these as an example of the form ~(2, 22,73) = c discussed 
in connection with the directional derivative. If r is a constant, then a 
spherical surface is defined. If 6 is constant, then a conical surface is defined. 
If ¢ is constant, then a half-plane is defined. These surfaces intersect at a 
point whose coordinates are (r, 4, @). Define unit vectors to lie along the lines 
of intersection of these surfaces and pointing in the direction of increasing 
coordinate. For example, r lies along the line of intersection of the cone and 
the plane and points away from the origin, i.e. in the direction of increasing r 
if 0 and ¢ are held constant to define the constant surfaces (cone and plane). 
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Similarly, 6 lies along the intersection of sphere and plane in the direction of 
increasing 6. Finally, @ lies along the intersection of sphere and cone in the 
direction of increasing @. See Fig. 2.1. 

Observe the following: 


18, 6, @ are unit vectors. They have unit length. In general, we will 
denote unit vectors associated with generalized coordinates as 61, 62 
63. 


2. (r,0,@) are an example of generalized coordinates. In general, we will 
denote generalized coordinates as (q', q”, q°). 


3. A small displacement of the particle is denoted ds. If 6 and ¢ are held 
constant in spherical coordinates, a small displacement resulting from 


a change in r would be 
ds(1) = drr. 


Similarly, if r and ¢ are held constant, 
ds(2) = rd0é. 
If r and @ are held constant, 


ds(3) = rsin 6dd@. 


4. A completely arbitrary displacement in spherical coordinates would be 
a vector sum of these three, 


ds = ds(1) + ds(2) + ds(3) = drt + rd00 + r sin Odd. 
If we form the dot product, ds - ds, we obtain 
ds? = dr? + r°dé? + r? sin? 6d”. 


In general, we denote this quantity as ds? = g;;dq'dq’, where a double 
sum is intended. 


5. The velocity of the particle is immediately found in spherical coordi- 


nates to be, 
ds dr. dé » __ do 
V= dt = aye tT ae + renee: 


A 
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Figure 2.1: The intersection of constant surfaces (sphere, cone, and half- 
plane) that define the spherical coordinates of a point. 
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The square of the speed is, 
var+re +r sin? 067. 
In general, this latter quantity will be written, 
= 950 V - 


6. Finally, the kinetic energy of the particle in spherical coordinates may 
be written, 


1 1 2 : 
z= zine = amr +779? +r? sin? 0¢°). 


In general, we will write, 


1 ee, 
f= 9 954 q’. 
The pattern which is illustrated here for finding the kinetic energy will 
turn out to be very important. 


2.1.4 Generalized Coordinates 


Imagine a set of generalized coordinates, q'(21, 22,73). When each is set equal 
to a constant, a set of surfaces are defined for which the point of intersec- 
tion, P, has coordinates Che Gi. q;). For spherical coordinates, the surface of 
constant r is a sphere, the surface of constant @ is a cone, and the surface 
of constant @ is a half-plane. The cone of angle 6, intersects the sphere of 
radius r, in a circle. The half-plane of constant @, intersects the circle at 
a point. The generalized coordinates that uniquely define this point, P, are 
(Fs Os Pp) * 

Now, define a set of unit vectors, é;, which are tangent to the lines of 
intersection of the various surfaces taken two at a time in the same way that 
the unit vectors in spherical coordinates were defined. Thus defined, the unit 
vectors must form a linearly independent set for the coordinates system to 
be “admissible.” The unit vectors are not necessarily orthogonal, although, 
like the unit vectors in spherical coordinates, they often are. Unlike the unit 
vectors in a Cartesian system, the directions of the unit vectors depend on 
the point at which they are defined. For example, in spherical coordinates, 
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the unit vector r has a different direction for different points in space. Its 
direction is given by the tangent line along the intersection of the cone and 
the half-plane, but as @ or @ varies, this tangent line varies in space. 
An arbitrary vector may be expressed as a linear combination of admis- 
sible unit vectors, 
a= a\e1 + Ao + A383 = a;e;. 


The set of numbers, a;, are said to be the physical components of a. The 
set of numbers, g’, are said to be the generalized coordinates of a particle 
in the system, but they do not necessarily have dimensions of length. The 
coordinates must uniquely define the position of the particle. 


2.1.5 Generalized Basis Vectors 


We now introduce a new set of basis vectors that have the same direction 
as the unit vectors, but are not necessarily of unit length nor dimensionless. 
Consider a point of intersection of the level surfaces such that ie and gq: are 
held fixed and only q' is allowed to vary. This defines the line of intersection 
of the 2-surface and the 3-surface which we can describe as a path s(1). A 
tangent vector to this path, b; is obtained by differentiation, 


As(1) 


b, = F 
1 aq! 


The partial derivative indicates that q? and q® are being held constant. The 
faster q' changes in space along s(1), the smaller will be the magnitude of 
b,. Two other generalized basis vectors, by and bs, are defined in a similar 
way. 

In this context, ds = ds(1)+ds(2)+ds(3) represents a differential change, 
dr, to the position of a particle at position, r. Thus, the partial derivatives 
imply that, 


Or _ Os(1) 
dg: dq’ 
Or _ Os(2) 
Ogg?’ 
Or _ Os(3) 
Oq? Oq? 
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Therefore, we may write, 
Or 
Og: 
If these new basis vectors are “admissible,” then we may use them as a 
basis for expressing an arbitrary vector, a, 


a= b, A! + bad" + b3A® = b; A’. 


The coefficients, A’, are called the contravariant components of the vector a. 
They are denoted with a superscript to establish a pattern for the Einstein 
summation convention: A repeated index, once as a subscript and once as a 
superscript, denotes summation. Unless otherwise indicated, this convention 
applies henceforth throughout this chapter. 


2.1.6 Theorem 


Theorem: ds = b,dq’. 
Proof: Let ds = ds(1) + ds(2) + ds(3) be the infinitesimal displacement 
from (q', q?, q°) to (q' + dq', q? + dq’, q° + dq?). By the chain rule, 
Os(7) 


ds = dq’. 
s Dei q 


(Sum on i.) But, by definition, b; = 0s(i)/0q', so ds = b,dq’. QED. 


2.1.7 The Metric Tensor 


The inner (dot) product of two arbitrary vectors, a and c may now be com- 
puted using the generalized basis vectors: 


a-c¢ = (b;A’) - (bjO’) = (b;- b;)A’C’. 


Thus, any inner product is characterized by the nine products, b;-b;. These 
may be grouped together into a 3 x 3 matrix G called the metric tensor with 
components g;; such that, 

Gij = b; . b;. 


Observe that the matrix is symmetric, g;; = g;;. (A tensor is a generalization 
of a vector. Vectors and tensors are defined by their transformation patterns 
when one changes from one coordinate system to another. In our present 
application, the tensor property of the metric tensor is not important.) 
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2.1.8 Theorem 
Theorem: ds* = g;;dq'dq’. . 
Proof: Since we have shown that we may write ds = b,dq', it follows that, 
ds -ds = ds” = (b;- b;)dq'd¢’ = gi;dq'dq’. 
QED. 


As a practical matter, we obtain the metric tensor by inspection of the 
form of ds”. For example, an arbitrary displacement in spherical coordinates 
is written, 


ds = drt + rd00 + (r sin O)o. 
Taking the dot product with itself, we have, 
ds? = dr? + r7d6? + (rsin 0)*d¢”. 
Comparing to the form ds? = g;;dq'dq’, we obtain, by inspection, 
1 0 0 


G=| 0 Pr? 0 
0 0. r?*sin?6 


We are now in a position to establish the relationship between the unit 
vectors and our generalized basis vectors. The length of one of the basis 
vectors is obtained from the inner product of the vector with itself, 


[b;| = bs «bs = Gq 


(No sum on (i7).) Since the unit vectors are already known to have the same 
direction as the basis vectors, we immediately have 


2.1.9 Theorem 


Theorem: a; = \/g@A". (No sum on (ii).) 
Proof: . . . 
Thus, a; = ,/9@A*. QED. 
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2.1.10 Reciprocal Basis Vectors 


Set each of the generalized coordinates, q'(x1, 22,73) = q'(x;) = c, where 
each constant, c, defines a different surface. The gradients of these functions 
of the Cartesian coordinates are vectors that are normal to the respective 
surfaces at the point (q), 47, q3), i-e., 


Vq' =b’. 


The vectors, b’, are called reciprocal basis vectors. Two sets of basis vectors, 
b; and b’, are “reciprocal” if they satisfy the relationship, b; - b’ = 67. This 
relationship says that b! is orthogonal to both by and bg. It also says that 
if b! has a large magnitude, then its reciprocal, b;, has a small magnitude. 
However, we must show that the reciprocal relationship is, indeed, satisfied. 


2.1.11 Theorem 


Theorem: b; - b’ = 6/7. 
Proof: The proof is a simple application of the chain rule. The generalized 

coordinates are independent of one another. Hence, 

_ oO j _ Oxt Og! _ OG _ gy 


hee vee 
Oq' Og Ox" — Oa" 


QED. 


An arbitrary vector, a, can be written in terms of reciprocal basis vectors 
just as well as it can be written in terms of unit vectors, 


a= Ab! + A»,b? + Asb® = A;b’. 


The set of numbers, A;, are called the covariant components of a. The recipro- 
cal basis vectors do not necessarily have unit length nor are they necessarily 
parallel to the unit vectors, but they must be linearly independent if the 
coordinate system is to be admissible. The covariant components of a are 
not the same as the physical components of a nor are they the same as the 
contravariant components. 
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2.1.12 Theorem 


Theorem: A‘ = b’- a. 
Proof: We use the relationship, b' - b; = 0%. 


b’-a=b’- (A’b;) = (b' . b;)A! = 6; A = A’ 
QED. In similar fashion, we can also show A; = b;- a. 


Define g’? = b’- b’. By doing so, we maintain a parallelism between the 
reciprocal basis vectors and the basis vectors. The elements, g’’, form the 
matrix G~! and are called the contravariant components of the metric tensor 
to distinguish them from the covariant components of the metric tensor, 9j;. 
Observe that G~!, like G, is symmetric, i.e. g’? = g?’. 

The covariant and contravariant forms of the metric tensor are useful in 
relating covariant and contravariant components of vectors. Note that the 
pattern of notation is consistent: an index repeated once as a subscript and 
once as a superscript denotes summation. 


2.1.13 Theorem 


Theorem: A; = giz A! : 
Proof: 
Aaya he (b; A’) = (b;- b,;) A? = 95h". 


QED. This important operation is called “lowering the index.” 


2.1.14 Theorem 


Theorem: A’ = g‘/ Aj. 
A’ =b'-a=b'- (b’A;) = (b’- b’) A; = 9” Aj. 


QED. This important operation is called “raising the index.” 
The covariant and contravariant forms of the metric tensor are inverses 


of each other, 
(Cm Clee 


or, 
jk _ sk 
GigG = 67. 
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If you apply the combination G~'G' in succession to an arbitrary vector, G 
will first “lower the index” and G~' will turn around and “raise the index,” 
returning you to where you began, so that the combination of the two, one 
after the other, acts exactly as the identity operation. In practice, this prop- 
erty is used to find the contravariant components of the metric tensor after 
the covariant components are extracted from the form, ds? = g;;dq'dq’. 


2.1.15 Theorem 


Theorem: b; = gigb”. 
Proof: 


1. In an admissible generalized coordinate system, the reciprocal basis 
vectors must form a linearly independent set of vectors. Thus, the set 
of basis vectors, b;, can be expressed as a linear combination of the 


reciprocal basis vectors, 
b; = aj;b’. 


2. We have previously established that, 
Jik = Gri = by + b; 


SO, 


gik = by - (aigb’) = aij(by, - b’) 


_ aij04 = ian: 
3. We conclude, . 
b; = gijb’. 
QED. 
2.1.16 Theorem 
Theorem: b’ = gbj. 
Proof: 
1. We have established that, 
gig"? = OF 
b= gigb’. 
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2. Hence, multiplying both sides of the latter by g* (and forming a sum 
indicated by the repeated index, 7), we have, 


gb; = g™(gijb’) = 6;b’ = b*. 
3. We conclude, 


QED. 


2.1.17 Theorem 


Theorem: Ob;/Oq! = Ob; /Oq'. 
Proof: This symmetry of pattern merely reflects the fact that the order 
in which partial derivatives are taken does not matter: 


db; 0 & 08 dr _ db; 


Oe) gh Og ~ Ba Oeh ~ Oa 
QED. 


Since each of these nine partial derivatives, Ob;/Oq’ are themselves vec- 
tors, we can express each as some linear combination of either the unit vec- 
tors, reciprocal basis vectors or basis vectors. We choose to express them as 
a linear combination of basis vectors, 


Ob; 
Oqi 


= Ts by. 


The numbers which form the coefficients of the sum generated by the dummy 
index k are called Christoffel symbols of the first kind. 


2.1.18 Theorem 


Theorem: 


r= gee Ogix , 291k _ 29s 
w 2 \ Oqi Og = Oq* 


Proot: 


dl 


1. We have, by definition, 


b; - by = giz 
b,; -b, = Gik 
b; - bj = gij. 


2. Differentiate each of these expressions in turn to form, 


OGik Ob, Ob; 

i nee cater 
Oq) Oq Oq) bi 
ee rec aee 
Og’ J Og Og’ by 

=bse— + -b; 
Og* Ogk = dqk 7 


and form the combination, 


Ogik O9Gjk - Ogi; 
Og Og’ gk 


3. We have proved as an earlier theorem that 


au eee ae 

dq — Ogk 
and, 

Ob, 2 ORs 

Ogi Og* 


Thus, when we form the combination 


Ogix , O9ix = 09% 
Og) Oq' Og’ 


four terms add out and two combine. We are left with the result, 


OGik OGjk - O9i; = Ob; ; 
Og = Og' —s Og ag 
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4. We may now use our assumption that 


Ob; 
Oqi 


Then, 


ODik OGjk _ O9i; 
Ogi Og’ ~—Ogk 


5. It is tempting to solve for i, by dividing both sides of this expression 
by 2gi,, but one must remember that the repeated index I indicates a 
sum of terms, not an isolated term. In fact, what we are dealing with 
here are twenty-seven equations, one for each of the combinations of 2, 
j, and k! The correct way to proceed is to multiply both sides by g”*, 
thus introducing an additional sum indicated by the dummy index k. 
We then use a result that we have already proved, 


Go =O". 


Thus, 
2G Gee 2671 = Ore. 


6. We conclude, 


pm a gmt Ogiz  O9jx _ 09% 
a 2\0q  Oq' Odq* 


QED. 


In principle, once one knows the covariant and contravariant forms of 
the metric tensor, the Christoffel symbols can be calculated. In three 
dimensional space there are twenty-seven of them. You will be relieved 
to know that for our purposes this is a formal result essential to the 
derivation of Lagrange’s equations that follow, but not one that we will 
use as a practical tool. 
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2.2 Application to Acceleration 


The mathematics we have introduced can now be extended in several 
directions. It was used by Albert Einstein as the original language 
of his General Theory of Relativity and a simple modification of it is 
also the basis of the most elegant formalism of the Special Theory of 
Relativity (see Chapter 10). The mathematical formalism can also be 
used to give general expressions for the divergence, curl and Laplacian 
in generalized coordinates. However, our immediate purpose is to apply 
the formalism to a particular vector quantity, the acceleration of a 
particle, and thus to reformulate Newton’s Second Law of Motion, F = 
ma, into a new form. The advantage of this new form is that it is 
done in terms of generalized coordinates, so that the reader gets to 
choose the most convenient coordinates for a problem without undue 
concern about how to write down acceleration properly in terms of 
the coordinates that have been chosen. Newton’s Second Law is a 
vector equation which, when resolved into components, becomes a set 
of ordinary differential equations. The equations which result from the 
reformulation of Newton’s Second Law are called Lagrange’s Equations, 
but they are completely equivalent ordinary differential equations which 
describe the motion of a particle. Our immediate purpose is to show the 
connection between Newton’s Second Law and Lagrange’s equations. 
We shall first find the contravariant components of acceleration, then 
“lower the index” to find the covariant components. Finally, we will 
write the covariant form of Newton’s Second Law and show that for a 
large class of problems, the covariant components of Newton’s Second 
Law are Lagrange’s equations. 


2.2.1 Contravariant Components of Acceleration 


(a) We begin with the expression for an arbitrary displacement: ds = 

(b) We divide both sides by dt. Observe that this is a division and not 
the process of differentiation, but the outcome is to turn ds into 
a velocity, 


(c) 


To obtain acceleration, we must differentiate v with respect to 
time, but we must note that as the particle moves, the basis vec- 
tors change with its position and must therefore be considered 
functions of time. Hence, 


dv, _ db; 


a=—_ = *b; : 
dt qb; +¢q 


To compute db;/dt, we can apply the chain rule, 


di og dt 7 agi 


From this, we conclude, 


a= Pbit PP ay = Fb: + FET ibs, 
or, _ 
a= [g* + @'@Ti]by = A* by. 


From this expression, we identify the contravariant components of 
acceleration as, 
kk | sinjpk 
AN = P+ GGT 


2.2.2 Covariant Components of Acceleration 


1. To obtain the covariant components of acceleration, we lower the index 
using A) = gue A*. 


Ar = 9% A* = ging* + ce 


2. We may use the expression which we have just derived for the Christof- 
fel symbols to write, 


1 
onl, = song" 


5 n (fe a OGin - a 


Ogi Og’ Oq” 


Observe, 


gg” = OF, 
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so that we can write, 


ony = 5ot (See + See - 38 


OP NOG! Og = Oq” 


Og Ogi Od! 


_ . (34 t Ogit =| . 


3. Thus, 


1 Ogu, Agxt Ogi; eg 
Og Og Od! , 


1= 9g + 5 


In principle, the covariant components of the acceleration can be ob- 
tained from this expression once the covariant form of the metric tensor 
is known. Again, this is a formal result that is essential to our deriva- 
tion, but not one that we will use as a practical tool. 


2.2.3 A Better Form for the Covariant Components of 
Acceleration 


We now begin to change the form of the covariant components of acceleration 
to put them into a form that is a practical tool. Consider the first term in 
the expression we have just derived for A;, namely, 


ged 
Observe that this term arises in the expression, 


FH ed = gg" + ag! qd; 


where we have used the chain rule in the second term on the right to obtain 


ay )= PGK om 
dt Tk agn 4 ; 


A renaming of dummy indices (in the second term below) immediately yields, 


d : - 
7 gine’) = gung° +— 


We may now write, 
A= — yh) Ee yb Ad 
= |, (Gu9") re 
1 Ogit fe Ogi Ogi \ «i “J 
2\d0q@ dq aq! 
Observe that by simply switching dummy indices and using the fact that 
gil = 9u, We can show that 


PG =e. 


O95 «i V= 
Og’ Oqi Og’ 


These equalities may be used to eliminate the middle three terms and to 
reduce our expression for A; to 


2.2.4 Newton’s Second Law in Covariant Form 


Finally, we arrive at the point of our entire excursion into differential geom- 
etry! The covariant components of Newton’s Second Law are Fy = mAg. 
In some texts, the covariant components of force, Fy, are called generalized 
forces. In terms of physical components, of course, Newton’s Second Law is 
the familiar f, = mag. 


1. Observe that ae 
ds? = gijdq' dq’ 


and that, therefore, we may write the kinetic energy, T, of a particle 


as, 
1 ods 1 — 

T = —m(—)? = <mg,,4'@. 

5G) 5 GT 


2. Observe further that if we take the view of generalized coordinates and 
generalized velocities as independent variables for purposes of taking 
partial derivatives, 

OT 1 Ogi; “ij 


age 2 Ogh 


Og* 
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and, 

ad OL d e d . 

dt age S me (gind )= ma, (Ged ) 
(The factor of one-half disappears in the latter two expressions because 
there is a double sum in the expression for T. If you have trouble seeing 


this, you should write out a short double sum so that you see it works.) 


3. We may therefore write, 


O94 saa5 =F (OT) OF 


dn 3 1 
Fi = mA, = m—(gndg") — (aa 
q 


—m — 
dt 2 Od! dt 
Newton’s Second Law, written in covariant form is, 


_ der) ar 
— dt dg’ Aq" 
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This is a perfectly general result. The primary difficulty with it is 
knowing how to write down F}. 


2.2.5 Lagrange’s Equations 


If we narrow ourselves to a class of problems that satisfies two additional 
conditions we arrive at a remarkable result. If the system we are describing 
is “conservative” so that we may define a potential energy function, V, such 
that 


and V depends only on the generalized coordinates and not on the generalized 
velocities, g’, then we may simplify the covariant form of Newton’s Second 


Law. We may write, 
d_ OT OT OV 


wag? aq Og 
d oT —V) _ oT —V) 
dt  q! Od! 
(Since V does not depend on the generalized velocities, it may formally be 
included in the first term without making any difference.) We now define the 
so-called “Lagrangian”, L, 


= 0. 


L=T-V 


38 


and write 
dol OL 


deg?) — Bg 
These are Lagrange’s equations and they are very important in theoretical 
mechanics. 


1. Lagrange’s equations are just the components of Newton’s Second Law 
for conservative systems written in a different and probably unfamiliar 
form. 


2. The equations that are obtained by operating on the Lagrangian ac- 
cording to the prescription of Lagrange’s equations are ordinary differ- 
ential equations. They are the “equations of motion” of the system. 
Solutions of the equations are models of the motion of the system. 


3. Much of the intervening machinery used to derive the equations disap- 
pears and does not return. For example, we do not directly use (for our 
purposes) b;, b’, I¥,, or g’. These were introduced in order to derive 
Lagrange’s equations and to show their connection to Newton’s Second 
Law. 


4. T and V are usually easy to write down correctly and once written down 
lead to the differential equations of the system in a very methodical way. 


2.3 Problems 
1. Do the following: 


e Draw a diagram (in two dimensions) of a particle whose position is 
specified by plane polar coordinates (r,0). Show the unit vectors 
f and @ on your diagram and clearly label them. 


e Express an arbitrary differential displacement ds in terms of these 
coordinates and show this displacement on your diagram. Find 
ds? = ds- ds in terms of dr and d@ and extract from ds? the 
metric tensor g;;. Display the metric tensor as a 2 x 2 matrix. 
Using ds”, write down the kinetic energy of the particle in terms 
of r and @ and their time derivatives. 


39 


e Determine g” and display as a 2 x 2 matrix. 


e Determine the covariant, contravariant and physical components 
of the acceleration of the particle. Show how these might appear 
on a diagram. (The diagram is only to be qualitative, i.e. it need 
only indicate directions of the vectors and whether the vectors 
have unit length or not.) 


e Let a particle move on a circle of constant radius b. What is the ra- 
dial component of acceleration (physical)? What is the transverse 
component? 


e A bug crawls outward with constant speed vp along the spoke of a 
wheel which is rotating with constant angular speed w. Find the 
radial and transverse components of the physical acceleration as 
functions of time. Assume r = 0 at t = 0. (Ans: a, = —votw?, 
ag = 2upw) 


2. Cylindrical coordinates are defined by x = rcosé, y = rsin@, z = z. 
Calculate the metric tensor, physical velocity and physical acceleration 
in cylindrical coordinates. 


3. Do the following: 
e Beginning with Newton’s Second Law, 


dO oT 
)~ age ~ Fe 


show that if Fi, can be expressed in the form, 


OV 
— Ue? 8 
k age I 92), 


then, 
d am _ OL 
dt Og*’ gk 
e For the one-dimensional harmonic oscillator, we can write F, = 
—kxz. Show that V = ska. Define the function L = T — V and 


obtain the differential equation of the simple harmonic oscillator, 
mi + kx = 0. 
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. Obtain the differential equations of motion for a tether ball. (Spherical 
coordinates. Observe that the tether line has constant length. The 
system has only two degrees of freedom and there will only be two 
differential equations of motion. The length of the tether appears as a 
constant in the Lagrangian.) 


. Obtain the differential equations of motion for a projectile on a flat 
earth. (Cartesian coordinates) 


. Obtain the differential equations of motion for the earth’s motion about 
the sun. (Plane polar coordinates) 


. Obtain the differential equations of motion for a particle moving with- 
out friction on the inside of a cone. The cone is parameterized in 
cylindrical coordinates as z = ar. Write the equations in terms of r, 7 
and 7 by eliminating z. 


. Obtain the differential equations of motion for a particle sliding without 
friction from the top of a hemispherical dome. (Plane polar coordinates. 
The particle does not separate from the dome. The system has only one 
degree of freedom since the radius of the motion appears as a constant 
in the Lagrangian.) 


. Consider an inclined plane consisting of a right-triangular shaped block 
that is free to slide on its bottom along a frictionless horizontal surface. 
If the incline slopes toward the right, the left-hand edge of the block 
is vertical. Define a coordinate X that keeps track of the position of 
this vertical side. Now, put another block on the inclined plane so 
that it slides down the plane (without friction) under the influence of 
gravity. Keep track of the second block’s position with a coordinate x 
measured from the top of the inclined plane. Obtain the differential 
equations of the motions of both blocks and solve them directly for 
the accelerations (%,X) of each. (Unusual, nonorthogal coordinates. 
This problem introduces a problem not found in the previous ones. 
Here there are two masses. In such a case, the Lagrangian is written 
simply as the sum of the the two independent kinetic energies of the 
two objects minus the sum of the two independent potential energies. 
The system has two degrees of freedom.) 
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Chapter 3 


Differential Equations 


3.1 Solutions to differential equations 


Newton’s Second Law of motion in one dimension, F = mz, is a second 
order differential equation. In the simplest cases or in the lowest order of 
approximation, this equation will sometimes reduce to a linear equation. 
Linear equations are the most studied of differential equations and have the 
most systematic methods for solution. Hence, textbook examples typically 
center on problems that can be reduced to linear equations. 

In those problems where the force can be derived as the derivative of a 
potential function, fF, = —dV/dx, the potential may often be such that it 
can be expanded in a Taylor’s series, usually about some minimum point. If 
the series converges rapidly enough, it can be truncated after one or a few 
terms. If the series takes the form: 


V(x) = Vot (dV/dx)2o(2 — 0) + (1/2)(a?V/ dx") 20(% — 20)? +-:- 
then, we get force terms of the type 
F = ky(x — 29) + ko(@ — 29)? +°°> 


which, in turn, lead to a non-linear differential equation of the form 


#+taxr+ Bx? +727 +---=0. 


If all but the linear term (az) are discarded, one has the equation for the 
simple harmonic oscillator. Thus, the simple harmonic oscillator sometimes 
becomes the simplest approximation to a more general problem. 
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In two- and three-dimensional problems, constants of the motion, such 
as the total energy or a component of angular momentum, may be used 
to reduce a multiple-dimensioned problem to a one-dimensional differential 
equation of the type derived above. For example, in the case of periodic 
motion under a central force, one can convert from a time-dependent differ- 
ential equation (7, 7,6, @) to an orbit equation relating r and 6 (in plane polar 
coordinates). We use the substitutions: 


5 eal es 
& = (dx/d0)6 = —(1/r?)r 
and use the constant of the motion, 
= mr’. 


We then have, 
r = —(€/m)(dx/d6) 


and 

= —(€?2?/m?)(d?x/d6") 
The resulting equation may be a one-dimensional equation that can be solved 
using the methods described here. 


3.2 Equations that may be either linear or 
non-linear 


3.2.1 Equation: F(x) = mi where F is only a function 
of position and may be derived from a potential 
function, F(x) = —dV/dz. 


Method of solution: Direct integration, separation of variables 
Solution: Use the chain rule to write ¢ = di/dt = (d&/dx)(dx/dt) = 
(du/dx)v = 1/2d(v?)/dx. Then, 


F(x) = d(1/2mv?)/dx = —dV/dz. 
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Both sides are perfect differentials, so T = —V + constant. This is energy 
conservation, E = T+ V. From energy conservation, it may be possible to 
solve for v = dx/dt = f(x) or invert to solve for x = g(v). 

If one can write dx/dt = f(x), then, 


(0) = [/f(a))ax, 
In conservative systems, this will take the form, 


t = (m/2)/? / (E — V(2))~ de. 


3.2.2 Equation: F(t) = mi where F is only a function 
of time. 


Method of solution: Direct integration, separation of variables 
Solution: 
F(t) = m(dv/dt) 


u(t) = (1/m) [FWat 


3.2.3 Equation: F(v) = mi where F is only a function 
of velocity. 


Method of solution: Direct integration, separation of variables 
Solution: Alternative 1: 


= m f (1/F(v))dv 
We may then possibly be able to solve for v(t) and integrate again: x(t) = 


J v(t)dt 
Alternative 2: 


& = (dv/dt) = (dv/dx)(dx/dt) = v(dv/dz) 


2(v) =m if (v/F(v))du 
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We may the possibly be able to solve for u(x) = dx/dt. Then 


t(x) = [C/o@)de 


We may then possibly be able to invert the expression for t(x) to solve for 


elt) 


3.3 Linear Differential Equations 


The most general linear first-order differential equation is: 


3.3.1 Equation: ay + 9(x)y = h(z). 
Method of solution: Integrating factor. 
Solution: Observe that 
aa 
dx 


Thus, multiply the given differential equation by the factor (called the inte- 
grating factor), 


/ i d / / 
(yeS 94") — (F + yg(a))el 90" 


ad ge ae 


which turns the left-hand-side of the equation into a perfect differential that 
can be integrated immediately. Then, 


Falyel ae) = nae! ame, 


Integrating both sides, we have, 


yes 9@/ae! = [roel Oe Fee On 


where C' is the constant of integration. From this expression, one may solve 
for y(x) if g(x) can be integrated. Formally, 


gn) = eC) h(x)et 94" dy +C). 


However, one should not remember the formula itself as a solution. Rather, 
an equation of this type should be solved by actually carrying out the steps 
just illustrated, 
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1. Compute the integrating factor. 
2. Multiply the differential equation by the integrating factor. 


3. Integrate both sides of the equation, remembering that the left-hand- 
side is a perfect differential. 


4. Solve the integrated equation for y(z). 


The first-order equation may occur in mechanics in the following way. 
If a frictional force proportional to the square of velocity exists, Newton’s 
Second Law may take the form, 


Observe that 


Thus, 


If we now define y = x”, we have, 


dy , g(a), _ 2h(x) 


dx m m 


which is of the form of interest. The solution yields velocity as a function of 
position, but only for problems in which the frictional force is proportion to 
the square of speed. 

The most general lznear second-order differential equation is: 


E+ f(t)t + g(t)x = hit) 


3.3.2. Equation: mi = constant (freely falling particle) 


Method of solution: Direct integration 
Solution: If the constant is -mg (freely falling particle) 


f= £9 + tot — (1/2)a¢? 
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3.3.3 Equation: mz + kx =0 (harmonic oscillator) 


Method of solution: Substitution of the form Ae”™ 


Solution: 
xX = c, cos(wot) + co sin(wot) = Acos(wot — a) 
where wo = \/(k/m) and c and cp or A and a are determined from initial 
conditions. 
3.3.4 Equation: mz + bi = —mg = constant (particle 


falling in a resistive medium) 


Method of solution: Integrating factor 
Solution: 
& = (mg/b)[e7/™" (1 + bitg/mg) — 1] 


x = £9 + (mg/b)[(m/b)(1 — e/") (1 + beo/mg) — t] 


3.3.5 Equation: mi+bi+kx = 0 (particle oscillating in 
resistive medium) 


Method of solution: Substitution of general form, Ae™ 
Solution: The general solution is: 


x = cye™ + c,e%?! 


where 


ay = —(b/2m) + 1/(b/2m)? —k/m 


ag = —(b/2m) — 1/(b/2m)2 — k/m 


In those cases where a; and a2 are complex numbers, the solution is 
oscillatory, but damped. It may then be written, 


g = e OPM) A cos(wit — a) 


wy = /k/my/ (1 — b?/4mk) 


where 


AT 


3.3.6 Equation: mi + kx = Focos(wt) (driven harmonic 
oscillator) 


Method of solution: Substitute a trial solution of the form A cos(wt) 

Solution: The solution is the sum of a particular solution plus the solution 
to the homogeneous equation (right hand side set to 0 , see above). The 
solution to the homogeneous equation is called the transient solution. The 
transient solution will contain two constants that are determined by the 
initial conditions. 

The particular solution is of the form: Acos(wt). Direct substitution 
leads to the following: 


& = —[(Fow) /m(w? — w)] sin(wt) 
1 = [(Fo) /m(w2 — w2)] cos(wt) 


where we = k/m. 


3.3.7 Equation: mi + bi + kx = Focos(wt) (damped, 
driven harmonic oscillator) 


(Also, mi + by + ky = Fosin(wt)). 

Method of solution: These two equations can be combined and solved 
simultaneously by defining the complex number z = «+ zy. By multiplying 
the second of the two by i = V—I and adding, we form the complex equation, 


mi+bs+kz = Foe 


After solving for z we can recover the solutions for x and y as the real and 
imaginary parts of z. 

The solution is the sum of a particular solution plus the solution to the 
homogeneous equation (right-hand side set to 0, see above ). The solution 
to the homogeneous equation is called the transient solution. The transient 
solution will contain two constants that are determined by initial conditions. 

Solution: The particular solution is of the form: 


z= Beiv?) 


where, 


b 
tand = Fai 


and, we = k/m: 


3.3.8 Equation: m%+ bt +kx = ay + ¥ Fi, cos(nwt — a,) 
(damped, periodically driven harmonic oscilla- 
tor) 


Method of solution: The right-hand side of the equation is a Fourier series 
representation of a periodic driving force. The constants ao, Fy, and ay, 
are assumed known from the Fourier series. A complex number z = x + iy 
is defined and the equation above is replaced with an equation in z. The 
solution for x will be the real part of the solution for z. The equation becomes: 


Mz + bz + kz = ao + S- Pierre en) 


The solution is the sum of a particular solution plus the solution to the 
homogeneous equation (right-hand side set to 0, see above). The solution 
to the homogeneous equation is called the transient solution. The transient 
solution will contain two constants that are determined by initial conditions. 

We proceed to obtain the particular solution by substituting a trial solu- 
tion of the form, 

z= Bot >) Belmrw 


Solution: 
r= (ao/k) + 5° B,, cos(nwt — Yn) 
where, 
Fi, 
ee ee 
m? (we — n2w?)? + w?n?b? 
and, 
bnw 
t f= Soe 
a m(we — n?w?) 
and, 
Yn = An + oe 


and we = k/m. 
The solution is simply a sum of the solutions that would result if each 
term in the Fourier series expansion of the driving force were present alone! 
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This important superposition principle is valid for systems which obey linear 
differential equations. The superposition principle breaks down for non-linear 
differential equations. 


3.3.9 Equation: gj;¢ + kijq’ = 0 (multi-particle, har- 
monic oscillation) 


Method of solution: If the kinetic energy and potential energy in a conser- 
vative, multi-particle system can be expressed as: 


Laas 
l= 9 isd q’ 


I aes 
Ve a hisd'? 


then the above differential equations follow. The equations can also be writ- 
ten in matrix form, 


GQ+KQ=0 


A trial solution of the form Q = X cos(wt — a) can be substituted. Q and X 
are column vectors. Solutions of the equation, 


det(K —w®G) =0 


(the secular equation) yields the eigenfrequencies, w;. When these are sub- 
stituted one by one into the matrix equation, 


(K —u’*G)X =0 


one obtains the individual eigenvectors, X;. 
Solution: 


Q = a,X, cos(wit — ay) +--+ + a¢X 7 cos(w et — af) 


where a;, a; are determined by initial conditions. 
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3.4 Nonlinear Differential Equations 
3.4.1 Equation: %+ar+ yx? = 0 
Method of solution: If, 
1. force on the particle is derivable from a potential, V(x) 
2. V(x) is symmetric about the Taylor’s series point of expansion, 
3. the motion is periodic, 
AC = lar t= 0 


then, the lowest-order, non-linear approximation is % + ax + yx? = 0. 

Since we assume that the solution will be periodic in time, we use sym- 
metry to eliminate terms in the solution of the type sin(wt) and assume a 
solution of the form, 


a(t) = bj cos(wt) + bs cos(3wt) +++: 


We assume that 63 is smaller than 6; and discard any terms of higher or- 

der. We substitute the form into the differential equation and gather the 

coefficients of cos(wt) and cos(3wt) and separately set them to zero. 
Solution: 


x(t) = b; cos(wt — b) + (7b3/32a) cos(3wt — 3¢) 


w? wat (3/4)yb? 


b; and ¢ are determined from initial conditions. 


3.4.2 Equation: %+ av — Gx? + yx? =0 


Method of solution: 
If, 


1. force on the particle is derivable from a potential, V(x), 
2. V(x) is asymmetric about the Taylor’s series point of expansion, 


3. the motion is periodic, 
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A Pa ate =, 
Be Ge: 


then, the third-order, non-linear approximation is #+ ax — Bx? + yx? = 0 
Since we assume that the solution will be periodic in time, we use sym- 
metry to eliminate terms of the type sin(nwt) and assume a solution of the 
form, 
x(t) = bp + b; cos(wt) + by cos(2wt) + b3 cos(3wt) +... 


We have assumed that the oscillation assumes its maximum amplitude at 
t = 0. We assume that b; > bo > bg and discard any terms of higher order. 
We substitute the form into the differential equation, gather the coefficients 
of cos(wt), cos(2wt), cos(3wt) and separately set them to zero. 

Solution: 


x(t) & (Gb? /2a) + b; cos(wt) — (8b? /6a) cos(Qwt) + ((7b?/32a) 
+ (37b? /48a*)) cos(3wt) 
where, 
w & a + (3/4) 7b} — (5870; /6q) 
3.4.3 Equation: ¢+ar+yx? = 0 


Method of solution: 
If, 


1. force is derivable from a potential, V(x), 

2. and if, the motion is periodic, 

3. and if, the potential is symmetric, let x = Ay(at) 

4. or if, the potential is asymmetric, let x = B+ Ay(at) 


Solution: The resulting equation may possibly be put into one of the 
forms for which Jacobian elliptic functions are known solutions. 
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13. 
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"4 y(1 — 2k) + 2k2y? = 0 

= cnu, y’ = (—snu)(dnu) 

=a Tsk) 

” + y(k? — 2) + 2y? = 0 

= dnu, y’ = —k?(snu)(cnu) 

Yelle jy?) 
k2) — 2y3(1 — k2) =0 


) a 
two 

—_ t+ 
— || 6 
S 

“~~~ 


a ,y’ = —(cnu)(dnu) /(sn?u) 
a a Ry + k?] 

ay Dk = 0 
cnu, y! = ay aia 
4/ (1 + y?)? ae 
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reoeSY 
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> 

be 
Onna 


I 
KS 


y’ —2 2 + dy + 1) — 6k7y? = 0 
y = sn2u 
(y’)? = 4y(1 — y)(1 — k? + ky) 
yi!’ — 2(1 — k2) — 4y(2k? — 1) + 62y2 = 0 
y = cn7u 
(y’’ = 4y(1—y)(y—1+k?) 
y” - 2(K? — 1) —4y(2 — k?) + 6y? = 0 
y = dn*u 
(y')? = 4y(y — 1)(y — k?) 
— 2k? + 4y(k? + 1) —6y? =0 
y = 1/(sn*u) 
(y’)? = 4y(y — 1)[(L — k?)y + k?] 
y” + 2k? — 4y(2k? — 1) — 6y?(1 — k?) =0 


(y')? = 4y[(1 + y)? — 4k7y] 
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y” — 2 — 8y(1 — 2k?) — 6y? = 0 
y = (dn?u)(tn?u) 


14, (y')? = 4y(1+y)[1 + (1 — ke? )y] 
y” —2—4(2—k?)y — 6(1 — k*)y? =0 
y = tn7u 


3.4.4 Equation: # + ur = f(z, 2) 


Method of solution: (Averaging Method) If the function, f, can be considered 
a small perturbation on otherwise periodic motion, then we can assume a 
solution of the form: 


x(t) = A(t) cos(wot + o(t)) 


Assuming that A and @ vary only slowly with time, they may be considered 
constant over one cycle of the motion. When this assumption is made, we 
can write differential equations for A and ¢ as follows: 


A = —(1/wo)(f (x, £) sin(wot + ¢)) 


§ = —(1/wo)([f(a, é) cos(wot + $)]/A) 


In performing the integrations, the form for x(t) is substituted, but A and ¢ 
are considered constant in the time average over one cycle of the motion. The 
resulting differential equations for A and ¢@ are then solved and substituted 
into the original assumed form to yield the solution. 


3.5 Numerical Solutions to Differential Equa- 
tions 


Consider the second-order differential equation, 
Cf ast) 
We can define a new variable, z(t), such that 
P= dea = 2b) 
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Then, our original equation can be reduced to solving two first-order differ- 
ential equations: 


dZta = J (e250) 
aivde= Z 


Note that the right-hand sides are free of explicit “dotted” variables. 
This scheme can be generalized for multi-dimensional systems: 


The underlying idea for solving the initial value problem is always this: 
Rewrite the dy’s and dt as Ay; and At. Then, 


Ay; = Fl Vigne LAE 


1.€. 


In the limit of taking At to be small (but not too small!), a good approxi- 
mation to the underlying differential equation is achieved. Literal implemen- 
tation of this scheme is called Euler’s Method. 

Solving differential equations by numerical methods is something of an 
art. The idea of the basic Euler method has been modified and improved in a 
variety of ways to improve accuracy, numerical stability, and speed. Runge- 
Kutta methods, the Bulirsch-Stoer method, and predictor-corrector methods 
are some of these variations. None can be said to be “the best.” The choice 
of method depends on the nature of the equation, the nature of the boundary 
conditions, etc. 

There are now a number of computer software packages that solve differ- 
ential equations by these or other methods. Some packages allow the user to 
choose among several available methods. In others, the solution occurs in a 
“black box.” 


3.6 Problems 


1. Computer Project 1 (see Appendix) 


2. Computer Project 2 (see Appendix) 
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Chapter 4 


One Dimensional Motion 


4.1 Definitions and Theorems 


Consider a particle which moves in a one-dimensional region of space under 
the influence of a force F. Consider two points, a and 6 in the region of space. 


4.1.1 Definition 


Work, 
b 
We = / Fax 
4.1.2 Definition 
Kinetic Energy, 
(1/2)mv”, 


where m is the mass of the particle and v is its speed. 


4.1.3. Theorem 


Let a and 6 refer to any two points on the trajectory of a particle. The work 
done between these two points by the force F(x, v,t) is equal to the increase 
in the kinetic energy of the particle. (Note: The force may depend explicitly 
on position, velocity and time.) 

Proof: By definition, 


b 
Wes | Fdr. 
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But Newton’s Second Law in one dimension allows us to replace F’ with 
mdvu/dt. Moreover, we can replace dx with (dx/dt)dt, so that we have, 


sales de oe it = - moat 


b 1 
—(= d(—mv’). 
-[5 dt (smd t= ff (gm) 
Since the argument of the integral has now been reduced to a perfect. differ- 
ential, we may integrate by inspection, 
1 


1 
Wap = aus _ aa =T, —Ty,. 


QED. 


4.1.4 Definition 


Potential Energy, 
Viet): 


If the force is a function of x and t but not the velocity v, and if it is possible 
to write 
OV (x, t) t) 

Ox 
then V (x,t) is the potential energy of the particle. (Note: Since the derivative 
of a constant is zero, any constant may be added to V(z,t) without changing 
the derived force.) 


i — 


4.1.5 Theorem 


If the force F(a) has no explicit time or velocity dependence, then 


dV (x) 
Biss = 
(2) =-—, 
and, 
Wap = —V, + Va. 
Proot: 
b b 
Wa= [ hE [ da = - i Vy 
QED. 
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4.1.6 Theorem 
If F(a) has no explicit time or velocity dependence, then, 
BST Vay + Ws 


i.e., the energy, F = T+ V is conserved and F'(x) is said to be a conservative 
force. 
Proof: Under the given conditions, we have both that, 


War = Th - Ts 


and, 
Wap = —V + Va 


If we subtract the second from the first, we conclude, 
Ty, + Vo = Ta + Va. 


QED. 


4.1.7 Theorem 


If the force F'(z,t) has explicit time dependence, we may still define EF = 
T+ V, but 


ab _ ov 
dt = Ot 
Proof: Since V depends both on position and on time, we have, 
OV 
Pres eS 
Ox 
and, using the chain rule, 
OV OV OV 
= —dt = —F — dt. 
dV An dz + oy dt dz + oy dt 


We can no longer write, 


p bdV 
Was = | Fade = — [ oe ey ay 
a a dx 
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because the middle step no longer holds. However, we can still define E = 


T+ V. Then, 
dE dl. OVdx OV 
Pie ae aer raed 
-“ OV 
= mzé — (mz)z + Oe 
Hence, 
dE OV 
dt Ot 
QED. 
4.2 Problems 


ile 


If the force acting on a particle is conservative, i.e., a function only of 
position and expressible as 


show for the one-dimensional case that a formal solution of the equation 
of motion is, 


x dx 
t—o= sss 
ie j (2/m) |B — V(z)] 


where Zo is the value of x at to and F is the total energy of the particle, 
E=T+V. 


. How long would it take an object of mass m that is released at rest from 


the position of the earth, r., to fall into the sun? The sun represents 
a fixed center of force that attracts the object according to the inverse 
square law F'(r) = —k/r?. Show that the time required for the object 
to reach the origin at the sun is 


3 
t= (8), 


8k 


What did dimensional analysis tell you to expect (see Appendix B)? If 
the object is the earth, how long does it take? 
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3. A particle of mass m is initially at rest. A constant force Fo acts on 
the particle for a time to. The force then increases linearly with time 
such that after an additional interval to the force is equal to 2Fo. Show 
that the total distance the particle goes in the total time 2tg is, 


13 

e ote /m. 
(Hint: Watch your constants of integration.) (G. R. Fowles, Analytical 
Mechanics, Holt, Rinehart, Winston, 1970.) 


4. A boy runs and then slides on some slushy ice. It is hypothesized that 
friction varies as the square root of the speed, F(v) = —cv'/?. The 
initial speed of the boy is vp at time t = 0. What does dimensional 
analysis give for a length scale for the problem? Find the values of v 
and x as function of the time t. Partial answer: 


1/2 2 


cu C 
x = vot — ( a \t? + ne 


fae 


Show that the boy cannot travel farther than 


Use some reasonable values of m, vp and naz to estimate c in SI units. 


5. A particle moves upward in a medium in which the frictional force 
is proportional to the square of the velocity. What does dimensional 
analysis give for a length scale? Find the velocity as a function of 
height. (Hint: Solve the differential equation for <? as a function of 
x.) Find the maximum height to which the particle will rise for a given 
initial velocity. 


6. Show that the time average (over one period) of the kinetic energy of a 
harmonic oscillator is equal to the time average of the potential energy. 
Hint: By definition, 

Jeeag Tdt 


Tavg = : 
: Jsceana dt 
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7. 


10. 


A linear oscillator consists of a mass m attached to a spring of force 
constant k. The oscillator experiences a viscous damping proportional 
to the velocity. The oscillator is initially quiescent. At t = 0, a force 
of constant value Fo is applied. Before calculating, describe as best 
you can what you think the motion will be. Find the motion of the 
oscillator. Show your final answer as one expression with arbitrary 
constants expressed in terms of initial conditions. Don’t forget the 
particular solution and the homogeneous solution. Convince the reader 
that your answer is correct. 


. Find the Fourier series expansion of the function, 


F 
F(t) = wt — Fy 
Tv 


O<wT < 2n. 
Ans: 


EG 
F(t) =—- >) —sin nat. 
(1) » mm 
e Write down a particular solution, with any constants expressed 
explicitly in terms of w, k,m,b, and Fo, to the equation 


oo. OF 
mz + br + kx = —S> =" sin nut. 
rene! 


e What is the solution to the homogeneous equation in terms of 
two arbitrary constants? (You need not evaluate the constants in 
terms of initial conditions.) 


Find the one-dimensional motion of a particle if its potential energy is 
given by 
V(a) = —kox 
a <0 
Le. 29 
Vig= ahi 
ee oa 8) 


Let the initial displacement be x2 > 0 and the initial velocity be zero. 
Before calculating, describe the motion you expect. Where will the par- 
ticle move slowly, where more quickly? Where are the turning points 
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(i.e. points where the velocity vanishes as the particle changes direc- 
tion). What is the period of the motion? The period is just twice the 
time it takes to get between turning points. Partial answer: 


2 
Period = z + tise 
7) 


Kig 
11. The potential energy of a particle moving in one dimension is 
1 k 
V(x) = kya? + =, 
2 x 


with ky > 0, k2 > 0, and x > 0. What do you guess the frequency will 
be for small oscillations. Show that the motion is periodic. What is 
the equilibrium position? What is the frequency of the motion if the 
amplitude of the vibrations is very small? 


12. In a particular problem dealing with the orbit (i.e., periodic motion) 
of a particle, the approximate differential equation of motion (obtained 
by doing a Taylor’s expansion of the potential function) is found to be 


dx 3 4 
1+ a7)\—~ 4+ 3(@ — 21) —- —(2@ — 2)? + G(x — 21) = 0. 
(1+ 08) SG + S(@— 21) - 21)? + Sle) 
In this equation a and x, are known constants. The variable x is de- 
fined, x = 1/r where r and @ are the plane polar coordinates of the 
particle. Using results from our summary of solutions to differential 
equations, show that 


1 1 1b? 1b? bar? 
eee ae —~ 1 ¢05(2 a. 
as + 5 + b; cos(\@) ; cos(20) + 16 cos(3\6) 


e What is \? equal to in terms of 7, and b,? 
e Which is the constant in this result to be specified by initial con- 
ditions? 


e Why are there not two such arbitrary constants? 


13. Since both the Coulomb force and the force of gravity obey inverse 
square laws, they share another feature in common. Show by analogy 
that if Gauss’ Law for the Coulomb force is 


« | B-dA= | pedV, 
A Vv 
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14. 


15. 


then, there ought to be a Gauss’ Law for gravity, 
[gaa —4nG | OmdV. 
A V 


Assuming the earth to be uniform, show that if a straight hole were 
drilled from the North Pole to the South pole, a particle dropped into 
the hole would execute simple harmonic motion. Before calculating, 
guess what the period will be, in minutes. What does dimensional 
analysis give for the time scale of the problem? Find the period of this 
motion. Let a be the earth’s radius, G be the universal gravitation 
constant, and M be the earth’s mass. Ans: Period = 27(a°/GM)"?. 


Assume that the sun is at the center of a uniform, spherical dust cloud 
of density p. Find the law of force which binds the planet to the sun if 
the cloud extends beyond the limits of the orbits. (Ans: 

F GMoun. 4 


a = a Gee = gtGort. 


A pendulum swings with an amplitude of 20 degrees. What is the 
percentage error in taking the period using the simple harmonic ap- 
proximation? How many minutes a day error does this make? 


(Begin with the conservation of energy equation for a simple pendulum. 
Show that by making the definitions and substitutions, 


k? = E/(2mg), 
y = (1/k) sin(6/2), 


u = /g/tt, 


one may transform the energy equation into the form, 


(Hy =v) — Hy). 


This equation has a known solution called the Jacobian elliptic func- 
tion, y = sn(u). Show that when @ has its maximum value, Ymar = 1. 
Then show that the period of the pendulum is given by, 


= : dy 7 
A= Ato f (1 — y)(1 — ky?) EE) 
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16. 


bg 
18. 
1: 
a0), 
21; 


K(k) is called an “elliptic integral of the first kind” and tables of values 
are available in the Chemical Rubber Company’s Standard Mathemat- 
ical Tables. In particular, K (0.17365) = 1.5828. Assume ,/¢/g = 1.) 


A particle is placed at the top of a smooth hemisphere which is sitting 
convex side up on a table. As the particle slides down the sphere, will 
it leave the surface before hitting the table on which the hemisphere 
is sitting? If so, at what angle? The condition for the particle and 
hemisphere to be considered separated is for the force exerted on the 
particle by the hemisphere to vanish. Find this force as a function of 
angle. (It is not particularly helpful to use the Lagrangian formulation 
here. Simply resolve forces and use Newton’s Second Law in its more 
familiar form, F' = ma.) 


Computer Project 3 (see Appendix) 


Computer Project 4 (See Appendix 


( 

( 

Computer Project 5 (See Appendix 

Computer Project 6 (See Appendix 
( 


) 
) 
) 
) 


Computer Project 7 (See Appendix 
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Chapter 5 


Motion of a Particle in Two 
and Three Dimensions 


5.1 Definitions and Theorems 


Consider a particle which moves in a two or three-dimensional region of space 
under the influence of a force F. Consider two points, a and 6 in the region 
of space. 


5.1.1 Definition 


The net force is the vector sum of all forces acting on the particle. 


5.1.2 Definition 


Then we generalize the definition of work to be 


b 
Wav = | F- ds. 


5.2 Theorem 


Let a and 6 refer to any two points on the trajectory of a particle. The work 
done between these two points by the net force F(x,v,t) is equal to the 
increase in the kinetic energy of the particle. (Note: The force may depend 
explicitly on position, velocity and time.) 
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Proof: By definition, 
b 
War = | F -ds. 


But Newton’s Second Law in one dimension allows us to replace F with 
mdv /dt. Moreover, we can replace ds with (ds/dt)dt, so that we have, 


b 6 dv ds 6 du 
War = | B-ds= [ m2. Sat= [mv Sat. 
b : Ss River Fras my dt 


It is useful in the last step above to observe that 
Viv=v 


so that differentiating both sides with respect to t gives, 


We then have, 
Ww =[ eae at = [acm 2) 
ale) RT Ge mae eae a 


Since the argument of the integral has now been reduced to a perfect. differ- 
ential, we may integrate by inspection, 


1 it 
Wp = —mv; — —mv? = 7T, — Ty. 
2 2 


QED. 


5.2.1 Definition 


A region ¥ is simply connected if every closed path in © can be continuously 
deformed into a point without any portion of the path passing out of ©. 


5.2.2 Definition 


If in a simply connected region 


fF ds=0, 


then F is said to be conservative in ». 
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5.2.3 Theorem 
A necessary and sufficient condition for F to be conservative is that for two 


points, a and 8, 
b 
i F -ds 


be independent of the path connecting the two points. 
Proof: Let us assume that F is conservative. By definition, 


fP-ds=0, 


We may divide this path integral which closes on itself into two arbitrary 
pieces and write, 


b a 
(Path1) / F - ds + (Path2) i F ds =0. 
a b 
We may reverse the order of integration in the second (thus changing its sign) 


so that, 
b b 
(Path1) i F - ds — (Path2) / F -ds =0 


and conclude, 
b b 
(Path1) | F - ds = (Path2) i F ds = 0. 


Thus we have shown that if F is conservative, the work done in going from 
point a to point b is independent of the path. Since the steps of the proof are 
reversible, we also conclude that if the work done in going from point a to 
point 6 is independent of the path, then the force, F, is conservative. QED. 


5.2.4 Theorem 


A necessary and sufficient condition for F to be conservative in a simply 
connected region ™ is that 
Vx F=0 
in di. 
Proof: If we assume that F is conservative, we may apply Stokes Theorem 
as follows, 


PP -ds=0= [ (Vx F)- ado. 


67 


In this expression, 4, is a simply connected region in which there is a differ- 
ential patch of area, do. The unit vector, n, is oriented perpendicular to do. 
Let the path of integration implied in ¢ F-ds shrink until it just bounds do. 
Then, 

(V x F)- ndo = 0. 


Since do and n are arbitrary, we must conclude that V x F = 0 in &. Again, 
the steps of the proof are reversible and the vanishing of the curl is a necessary 
and sufficient condition for the force to be conservative. QED. 


5.2.5 Definition 


Potential Energy, V(x,t). If the force is a function of x and t but not the 
velocity v, and if it is possible to write 


F = —-VV(x,t) 


then V (x, t) is the potential energy of the particle. (Note: Since the derivative 
of a constant is zero, any constant may be added to V(x, t) without changing 
the derived force.) To define the potential energy, we must have, 


1. F must be conservative in a simply-connected region, ©. 


2. The potential energy function must be given some arbitrary value at 
one point in the region which we designate here as point a. 


3. The potential energy at all other points in the region (of which 6 is one 
such point) is then defined 


from 


b 
Ww=Ve- f Feds: 


Thus we have, 
Wa = —V, +e Ves 


5.2.6 Theorem 


If F(x) is a function of position only and conservative in 4, then energy is a 
constant and is said to be conserved, 


E=T+-+V. 
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Proof: Under the given conditions, we have both that, 
Wa = Th — Ta, 


and, 
War = —V + Ve 


If we subtract the second from the first, we conclude, 
Betty = tact Va 


QED. 


5.2.7 Theorem 


If F is conservative in a simply connected region, then F = —VV where V 
is the potential energy function. 

Proof: Consider two points which are a differential distance, ds, apart in 
yu. From 


b 
Vj=Va- | F-ds 


we have that 
F-ds=V, —V, = —dVv. 


From the definition of the direction derivative, we have 


dV = VV -ds. 
Thus, 
—F-ds=VV -ds, 
from which we conclude, 
F = -VV. 


QED. 


5.2.8 Definition 


The Lorentz Force is the force exerted on a charged particle (of charge q) by 
a combination of electric (E) and magnetic (B) fields, 


F = q(E+v xB). 
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5.2.9 Theorem 


The magnetic field does no work on the particle. 
Proof: Let the Lorentz force be the net force on a particle. Since dW = 
F - ds, 


WwW ds = 
do dt.” 
Thus, we have, 
i Gans ee eee LS a at 
Goa a 
=ma-v 
_ aw 
dt 
So, 
dW aT 
Ee a ma v= adE+vxB)-v=qE-v 


which does not depend on the magnetic field so that the magnetic field does 
no work on the particle. QED. 


5.2.10 Theorem 


If the electric and magnetic fields are time independent, 7’+ qw is conserved. 

Proof: If the electric field is time-independent, it may be derived from an 
electric potential function, E = —Vw. Using a result from the proof of the 
previous theorem, 


dT dW 
eS ee gE -v = —qVw-v. 
But, dw = Vw - ds (chain rule.) Thus, 
dp 
dt bee ae aa 
Therefore, 
d 
T =; 
qi tm) = 


or, 7’ + qw is a conserved. QED. 
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The potential energy, V, of a charged particle in an electric field is qw, so 
this is equivalent to saying that the energy, E = T’+V, is conserved. However, 
it is not true that the Lagrangian is given by L = T’—V in this case, since the 
Lagrangian must produce both the forces exerted by the electric as well as 
the magnetic fields. While it goes beyond the scope of the present discussion, 
we note here for completeness that the correct Lagrangian is, 


L=T-q-+qA-v), 


where A is the vector potential which yields the magnetic field according to 
B=VxA. 


5.2.11 Definition 


Angular momentum, 
L=rxp. 


5.2.12 Definition 


Torque, 
N=rxF. 


5.2.13 Theorem 


If the torque on a particle vanishes, the angular momentum of the particle 
is conserved. 

Proof: We may differentiate the definition of angular momentum to ob- 
tain, 


ue dr dp 
dea» dt 


=vx(mv)+rxF=mvxvtN. 


Since v x v = 0 and since the torque, N, is assumed to vanish, we conclude 
that 


dt ’ 
i.e. we conclude that the angular momentum is a constant and is conserved. 


QED. 
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5.2.14 Theorem 


In a system with only conservative forces, the explicit absence of a coordinate 
dq: in the Lagrangian implies that OL/0Oq, is a conserved quantity of the 
motion. Observe that the explicit absence of a coordinate in the Lagrangian 
results from a spatial symmetry of the problem. Thus, there is a direct and 
immediate connection between symmetry and a conserved quantity of the 
motion. 

(Note: In this and subsequent sections, we will attach subscripts to the 
generalized coordinates rather than superscripts. The use of superscripts is 
not particularly useful here since we no longer are concerned with maintaining 
a distinction between covariant and contravariant components. ) 

Proof: Lagrange’s equations apply to a system with only conservative 


forces, 
dOL OL 


Man qe 
If the Lagrangian has no dependence on a particular coordinate, q,, the 
second term of equation k vanishes and we have, 


d OL | 

dt Ody 
We conclude, therefore, that OL/Oq, is a constant with respect to time and 
is conserved. QED. 
5.2.15 Definition 


Consider the Lagrangian to be a function of generalized coordinates, g;, and 
the corresponding velocities, q;, and of time, t. Hence, L = L(qi,q,t). The 
generalized momenta are then defined to be, 

5.2.16 Definition 

The Hamiltonian, H, is defined to be, 


H =) pidi — L = pigi — L. 
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To complete the definition, however, the g; must be replaced with expressions 
involving q; and p; so that H = H(p;,q;,t). The definitions of the generalized 
momenta are used to make this replacement. 


5.2.17 Theorem 


The differential equations of motion of the system may be written, 


eo OH 
Pi ~~ Ogi ’ 
. OH 
Gi ~~ Op,’ 
OH... OL 
ot Ot 
Proof: 
1. By definition, 
_ OL 
OG: 
2. From Lagrange’s equations, 
dOL OL 
dtdgi Ogi 
lees 
d ; OL 


Pi) =e Dae 
Observe how the dot moves back and forth in the expressions for p; 
and 7;. 
3. By definition, 
A= pigi — L(G, %,¢) 
where a sum is implied by the repeated index in the first term. Hence, 
OL OL 


Are OL ,. 


OL OL 
= (pi — i + qidp: — (pi)dg; — dt. 
(Pi — Be )dai + dies — (Pid: — 
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The coefficient of dq; vanishes by the definition of generalized momenta. 
We also used the above result from Lagrange’s equations to replace 
OL/0Oq, with p;. Thus, on the one hand, 


DL 
dH = gdp; — pidq — oat 


4. On the other hand, consider the Hamiltonian as a function of general- 
ized coordinates and generalized momenta, H = H(p;,q;,t). Then 


OH OH OH 
A SS S=——> 4 a 4 "aes . 
d cm dp; + Dai dq; + aE dt 


5. Comparing the two expressions for dH, we conclude, 


OH 

Pe Oa 
__ OH 
Ti = Op,’ 

OH OL 

‘Ott 


These equations are called Hamilton’s Equations. Observe that they are 
first order rather than second order (as obtained from the Lagrangian). Ob- 
serve also that p; and q; play very symmetrical roles that blur the distinction 
between coordinates and momenta. This is particularly so for generalized 
coordinates and momenta that may not even have the units that we ordi- 
narily associate with Cartesian coordinates and momenta. In a system with 
n degrees of freedom (requiring n coordinates for its description), the phase 
space of p;, gq; has 2n dimensions. 


5.2.18 Theorem 


If OL/Ot = 0, then the Hamiltonian is a constant of the motion. 
Proof: In general H = H(p;, q;,t). The chain rule yields, 
dt Op; dt  Oq, dt Ot 
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= (a)(6:) + (oy a) + (2. 


The first two terms add out and the third vanishes by assumption. Thus, 
dH 
en | 
dt 
and H is a constant of the motion. QED. 
5.2.19 Theorem 


If OL/Ot = 0 and the system is conservative, the Hamiltonian can be inter- 
preted as the total energy of the system, 


di Se, 


Proof: We may write the kinetic energy of the particle as 


1 en 
T= gh IIUG- 


From the definition of generalized momenta and given that the system is 
conservative so that the potential energy is a function only of the generalized 
coordinates and not the velocities, 


If we now form the Hamiltonian, 
Thus, H =7T+V. QED. 


5.2.20 Example 
Consider the Lagrangian for a two-dimensional harmonic oscillator, 
1 1 
Lat LVS ue +77) 5 (e 9?) 


The generalized momenta are, by definition, 


aL 


7 Ox 
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Py = Oy = my 
The Hamiltonian is 
ae eR 1, ; 1 
H = (mx)(x) + (my)(y) — ace ty) 4 5 (e ty”) 
Thus, 
1 1 
H=T+V=5m(#+H)+5@ +9’), 


However, we must express the Hamiltonian in terms of generalized coor- 
dinates and generalized momenta, i.e., 


1 
H = ——(p2 + py) + =(2? +9’). 
m 


Hamilton’s equations for this system then become, 


OH 
= —-S> = ke, 
P Ox 2 
OH 
= —-— =-k 
Py Oy Y; 
ps OH _ Dz 
— Op, mM’ 
« OH Dy 
Ua 
Py m 


In this form, these first-order differential equations would lend themselves 
to direct numerical solution by computer. However, we can combine them 
to put them into more familiar second-order form. Differentiate the latter 
two with respect to time and substitute into the former two, eliminating 7; 
in the process. We then obtain the second-order, two-dimensional harmonic 
oscillator equations that you would obtain from the Lagrangian. 


mi +kx = 0, 


my + ky = 0. 


Although Hamilton’s equations can be used readily in place of Lagrange’s 
equations to obtain the differential equations of motion, they are more use- 
ful in supplying fundamental postulates in quantum mechanics, statistical 
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mechanics and celestial mechanics. Although quantum mechanics cannot be 
derived from classical mechanics, the Hamiltonian nevertheless provides a 
bridge between the two. For example, a certain well defined substitution 
turns the Hamiltonian of classical mechanics into the Schrodinger equation 
of quantum mechanics. 


5.3 Problems 


1. If you shoot a cannon uphill, how should you aim to maximize your 
range on the hill? Imagine that the angle @ characterizes the slope 
and @ is the angle of elevation relative to the slope. Find the angle y 
relative to the horizontal, i.e. y = 0+ @ that maximizes the range up 
the hill. Is your answer reasonable if @ = 0? What is the range up the 
hill? Are your answers reasonable if 3 = 1/2? 


Partial answer: R,, = (v2/g)/(1 + sin @). 


2. A gun at the origin of coordinates has a muzzle velocity of 1600 ft/sec. 
An airplane travels on a course given by x + z = 3.5 x 10* ft, and at 
a constant altitude y = 104 ft. Over what portion of its course is it in 
danger? (T. C. Bradbury, Theoretical Mechanics, John Wiley & Sons, 
1968). 


3. Consider a tether ball held at chest height with rope extended. Find 
the equations of motion, the energy equation and define an “effective 
potential” for the motion. How fast (feet per second) do you have 
to propel it horizontally (with the tether fully extended) to keep it 
from falling below your knees? As the ball revolves around the pivot 
(assuming the tether keeps the same length) it will oscillate between 
its highest point and its lowest point. Estimate the frequency of this 
latter part of its motion. (Let’s take chest height to be five feet and 
knee height to be two feet. Let the length of the tether be 8 feet and let 
the ball just reach the ground when it is at rest. Let’s assume the ball 
weighs about a pound and the weight of the tether can be neglected.) 


The purpose of the problem is to demonstrate how an essentially two- 
dimensional problem can be reduced to a one-dimensional problem us- 
ing conserved quantities, one of which can be identified by symmetry 
with respect to a coordinate in the Lagrangian. The solution to the 
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problem will require a Taylor’s expansion about the minimum of the 
“effective potential.” To find the position of the minimum of the po- 
tential for purposes of the expansion, you will be forced to a numerical 
rather than an analytical solution. When you have finished the prob- 
lem, you ought to review it to identify the techniques you have used 
that have more general applicability than this specific problem. The 
frequency asked for in the problem is found by reducing the problem 
by approximation to the simple harmonic oscillator. 
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Chapter 6 


Accelerated Frames of 
Reference 


We have already derived the following results: 
de, 
dt 


Nn 
wxe = OijkOjWp- 


vevitwxr'+Vo. 


a=al+2Qwxvitwxr'+wx (wxr’)+ Apo. 
Therefore, Newton’s Second Law can be written, 
F=ma=m(a'+2wxvitwxr't+twx (wx r’) + Ao) 


Written this way, the second, third and fourth terms inside the parentheses 
on the right-hand-side are known as the Coriolus acceleration, the transverse 
acceleration and the centripetal acceleration respectively. If, on the other 
hand, one would like to preserve the form of Newton’s Second Law for ob- 
servers working in a non-inertial frame, F = ma’, then one may rewrite the 
above equation in the equivalent form 


F — (Qmw x v' + mw xr’ +mw x (w xr’) +mAg) = ma’. 


In this form, we refer to a Coriolus force, transverse force, and a centrifugal 
force. The extra forces now on the left-hand-side are fictitious forces arising 
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from the choice of a noninertial frame in which to work. In introductory 
courses in mechanics, instructors often go to great lengths to wean students 
from the notion of a centrifugal force, but as we see here, there is a perspective 
in which it is useful. 


6.1 Theorems 


6.1.1 Theorem: Lagrangian for Motion in a Noniner- 
tial Frame 


The general Lagrangian for motion in a noninertial frame is, 
1 1 
L=T-V+ sir — gine’ -w)? —m(r’ + ag) + mv! - (w x 7’) 


Proof: We shall obtain this result by trial-and-error. We observe that the 
terms in the Lagrangian are scalars or pseudoscalars. We simply wish to add 
additional terms to the Lagrangian so that Lagrange’s equations produce the 
non-inertial forces which arise in a non-inertial frame. The non-inertial forces 
are, 

Fy; = 2m(v! x w) + m(r’ x w) + m(w x (r’ x w)) — mao. 


We shall identify these terms as follows: 
I = 2m(v' x w) 
Il = m(r’ x w) 
III = m(w x (r’ x w)) — mag. 


The term III depends only on position and is thus similar in nature to a 
conservative force. Thus, we seek a scalar term wy;; that can be subtracted 
from the Lagrangian in much the same way that V is subtracted from T. 
Thus, we seek a function wy ;; such that, 


OW 
Oxi 


Using the so-called B(A -C) — C(A - B) rule, we can write (taking the i-th 
component after applying the rule), 


— 


/ / / 2 / 
III; = rw” — WW; — Agi = Lj0jiW — WE Wi — Aoi 
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= x (Sijuu? <= Wi W;) — aoi- 
It is straightforward to show that a suitable scalar function is, 


12, 2 


1 1 1 1 
Vu = — 5 Sig” + = (aw) (xjwi) + vjaoj = —5rhw + 5 -w)? +9 - ag. 


2 

The Coriolus force is a little harder. Now we seek a scalar that involves 

both the velocity of the particle and the angular velocity of the frame, but 

which somehow also yields the cross-product of the Coriolus force. About 
the simplest thing that one could try is, 


wr = mv’ . (w x r’) = ME, OijghWj Ly 
Then, if we apply the Lagrangian operator, 


doy, Oy 

abog, Or; 
we have three terms. The first is a pleasant bonus. It is just negative of the 
[-th component of the transverse force which we need anyway. (Observe that 
Lagrange’s equations are in the form mQ, — F; = 0, so that the operator 
is supposed to generate the negative components of the non-inertial forces.) 
The second looks like the negative of the /-th component of the Coriolus force 
but lacks a factor of two and a minus sign to be correct. The third term, 
except for some manipulations of the indices, is also a Coriolus force term 
which combines with the second term to provide the missing factor of two. 
Thus, we have, 


< / -/ -/ 
= MOjRWjL;, + MO jRW;L;, — ML, Oi;1W;, 


1 1 
L=T-V—-y74+ 1 = T-V+5mr"w*—Sm(r!-w)’—m(r'-ap)+mv"-(wxr'). 
QED. 


6.1.2 Theorem: Lagrangian for Motion in a Noniner- 
tial Frame 


In problems describing particle motion in a rotating (noninertial) frame where 
w is small (terms proportional to w? can be ignored) and ag = 0, the La- 
grangian becomes: 

L=T-V+m(wxr’)-v’ 
where w is the angular velocity of the rotating frame, r’ is the particle position 
in the frame, and v’ is the particle velocity with respect to the noninertial 
frame. 
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6.2 Problems 


1. A bicycle travels with constant speed v around a track of radius p. 
From the point of view of the inertial frame, 


/ 
a=at ACoriolus <7 Acentripetal + Atransverse 1 ao. 


Evaluate each of these terms for a point on the rim at the top of a 
vertically-upright wheel of radius b with body axes attached to its cen- 
ter and the z’ axis always vertical (thus not rigidly attached to the 
wheel). The purpose of the problem is to give you experience in evalu- 
ating the different terms in the general expression for acceleration used 
in noninertial frames. 


Ans: 
2 


Ah 
eee’ Aan 


BU. - 
=> — XxX 
p b 


a 


2. A bug of mass m crawls outward, with constant speed v, along the 
spoke of a wheel which is rotating with constant angular speed w about 
a vertical axis. Find all the forces (real and fictitious) as seen from the 
point of view of the bug and show them on a diagram. The purpose 
of the problem is to allow you to demonstrate that you understand 
how “fictitious” forces arise in noninertial frames and how they are 
interpreted. 


Ans: 


If the z’ axis is vertical and the bug walks outward along the 2’ axis, 
then 
F = 2mwoy’ — mw? 2x’. 


3. A projectile is fired due south from a gun located at 45° latitude. The 
velocity of projection is 100 ft/sec and the angle of elevation of the 
gun above the horizon plane is 45°. Find the correction to the point of 
impact due to the Coriolus force. 


The problem is intended to demonstrate the Coriolus effect, but also 
to demonstrate how differential equations may sometimes be solved us- 
ing “perturbation theory.” In this problem w is “small.” We therefore 
expect the Coriolus force to result in a small correction (which we call 
a “first-order” correction) to the motion we would otherwise expect in 
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the absence of w. We use the presence of w in the various terms of our 
equation to decide what to keep and what to ignore. If w is small, then 
any terms involving w? (or higher powers) are much smaller and can 
safely be ignored in the approximation. To apply perturbation theory, 
write down Newton’s Law for the problem in a noninertial, Cartesian 
frame on the surface of the earth. Include any terms involving one 
power of w, but discard any with higher powers of w. This should leave 
you with the Coriolus term, but should allow you to discard the cen- 
trifugal term. Now assume that x(t) = xo(t) +wai(t) +... (and similar 
expressions for y(t), z(t)). Substitute these forms into the equations of 
motion and separately equate terms that have no w in them and then 
separately terms that have a single power of w. Discard any terms that 
have higher orders of w. Now systematically solve first the zeroth-order 
equations and use these solutions to solve for the first order corrections. 
When you have completed the problem, review what you have done and 
try to identify techniques that have more general applicability than this 
particular example. 


. A ten-ton freight car is headed due north at 45° latitude on the earth’s 
surface at a velocity of 100 ft/sec. What is the sidewise force on the 
railroad tracks due to the Coriolus force? (Note: This is a very simple 
illustration of the magnitude of the Coriolus force. You will not need 
perturbation theory to solve the problem.) 


. The acceleration due to gravity at the equator of the planet Uranus, 
exclusive of the centrifugal acceleration, is about 950cm/sec?, a value 
which is close to that on earth. The period of rotation of the planet 
is quite short, being about 10.7 hour. The mean radius of Uranus 
is 23.8 x 10° km or about 3.73 earth radii. What would be the actual 
measured value of the acceleration of a falling object at the equator (in- 
cluding Coriolus and transverse accelerations but not centrifugal which 
is second order small for small w)? A rocket is to be fired vertically from 
the surface of Uranus at the equator. What is the minimum velocity 
for escape measured relative to the planet? What would be the initial 
direction of the rocket as seen by an observer in an inertial frame? (T. 
C. Bradbury, Theoretical Mechanics, John Wiley & Sons, 1968). (Note: 
While the centrifugal term is proportional to w? and can be neglected 
in this problem, the term we associate with the symbol ag is also pro- 
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portional to w* but cannot be neglected for a noninertial frame fixed 
to the surface of the planet. Do you see the difference? The problem 
also illustrates the concept of “escape velocity,” i.e. the firing velocity 
such that the rocket would just come to rest at infinite distance. It is 
the velocity which makes the total energy of the rocket be zero.) 


Ans: 
g = 887cm/sec? 


° 
Ouniitisidiraction = LOS": 


. Find the Lagrangian for a satellite launched into an arbitrary orbit 
around the earth from the point of view of observers on the earth. Use 
spherical coordinates with origin at the earth’s center and fixed with 
respect to the earth. Is there a missing coordinate in the Lagrangian? 
If so, find a constant of the motion. What does the orbit look like for 
the special initial conditions, 6 = —w, i.e. how does the Lagrangian 
simplify in this special case and what kind of simplified motion does 
the Lagrangian now describe? 
Ans: 

LS sn(#? + 76? +r? sin? 0(¢ + w)*) + cmt 
. Experimental: Measure the period of the Foucault pendulum in the 
foyer of the Eyring Science Center and use it to estimate the length of 
the wire (in meters) that suspends the pendulum bob. Thus, estimate 
how high the building rises above the foyer since the top of the pen- 
dulum wire is fastened to a point just under the telescope dome at the 
top of the building. Then, by further observing that the pendulum’s 
motion has a slow progression of 9.65 degrees per hour, estimate what 
the latitude of Provo is. 


. Experimental: Experience the rotating room (also called the Vomit 
Vortex). The Vomit Vortex is a large metal cylinder that looks sort of 
like two livestock watering troughs stacked together to make a round 
room. The room contains a bench, a table, a pendulum attached to 
a magnet that sticks to the ceiling, and a box of balls of various sizes 
and densities. Get a group of 3-6 people together, find either your 
instructor or a designated alternate so that the room can be located, 
unlocked, and plugged in. Do the following experiments and write a 
short report describing what you did. 
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e Have someone stand outside and measure w, magnitude and di- 
rection, for the room. 


e Measure the deviation of the hanging pendulum from the vertical 
and check to see if your measurement agrees with what you would 
expect from the centrifugal term in the acceleration expression for 
non-inertial frames. 


e Calculate the direction of the Coriolus force and verify, by playing 
catch with each other in the room, that your calculated direction 
is correct. 


e Think up something creative to do in the room and see if you can 
explain what happens. 


9. Computer Project 11 (See Appendix) 
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Chapter 7 


Systems of Interacting Particles 


We may sum over particles to extend definitions to systems of interacting 
particles. 


7.1 Definitions and Theorems 


7.1.1 Definitions 


F = [> F, = S- |e ae S- Boers 


p= >) ps 
£=°> ly = ta * Da 
N= )-Na= ora x Fa 


7.1.2 Theorem 
F=p 
Proof: For each particle, we have Fy = py. Thus, by definition, 
Pe ee 
dt dt 
QED. 
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7.1.3 Theorem 


N=2 


Proof: For each particle, we have 


Ne = ee 
Thus, 
Ned Ne= tos Pa he Xx Da 
d : 
meer (Ta X Pa) — 9 fa X Pa- 
But, 
To X Pa = Mala X Tq = 0. 
So, 
d d d . 
Se (Ta X Pa) = Gp Diba = Gl = fe. 
QED. 


7.1.4 Definition 


An isolated system is defined to be a system on which there are no external 
forces of any kind. For such a system, the total linear and angular momentum 
is assumed to be constant. 


7.1.5 Theorem 


For an isolated system, 
S- Beers =i) 


SS Nerne =i) 


Proof: If the total linear momentum of the system is assumed to be 
constant, we have, 


p —-Q)=F= S- F, _ S- pee it st pee 
But, the sum of external forces vanishes for an isolated system. Hence, 


S- eae =i 
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QED. 
Similarly, 


Na =T, X F, =r, x (ee ah Fo") = Ni” ai Ne 


If the total angular momentum of the system is assumed to be constant, we 
have, ; 

£=0=N=S N+ 5 °N@". 
If there are no external forces of any kind on an isolated system, the external 
torques must vanish because the external forces vanish. Thus, 


NON =, 


QED. 


7.1.6 ‘Theorem 


If the net force on a system is zero and if the net torque about any one point 
is known to be zero, then the net torque about any other point is necessarily 
Zero. 

Proof: We assume that the net torque vanishes relative to some point, O. 
Thus, relative to this point, Sr, x F, = 0. Let a point O’ have a position 
R relative to O, so that r, = R+r’,. We then have, 


0=S(\(R44r)xF,=Rx>OFL+ > mx Fa. 
But the sum of the forces vanishes, so, 
Si, ee 0 


i.e. the net torque about O’ vanishes. QED. 


7.1.7 Theorem 


The work done on a system of particles by the net force (including external 

and internal forces) equals the change in kinetic energy of the system. 
Proof: Let us assume that a general displacement of a system is made 

so that each particle is displaced by an amount ds,. We assume that each 
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particle has a net force, F,, acting on it (which includes both internal and 
external forces). The work done is, 


_ fe ON 6 gph en Ni 9 
dW =S °F, - dso = > ma Fr Vadt = (Dd 5 Maa dt 


1 
= 5° d(sMava) 
=e Hae 
QED. Observe that > F’”* = 0 does not imply that > Fi” - ds, = 0! 


7.1.8 Theorem 


If the forces on a system are conservative, the work done on a system of 
particles by the net force is equal to the change in potential energy of the 
system. In such systems, energy is conserved, i.e., EF = 74+ V. 

Proof: If the forces on the system are all conservative, we assume that the 
forces on each particle may be written F, = —V,V, where V is a function 
of the 3N coordinates of the N particles, 


V= V(21, U2, U3, U4, ++, 3Nn) 


and V, is the gradient with respect to the coordinates of a given particle. 
Using the fact that dV, = V,- dS, we have, 


dW = °F, + d8q = —S>VaV «ds = — > dV, = —dV. 


Thus, dW = —dV. QED. 

For conservative systems, we have, therefore, dW = dT’ and dW = —dV. 
Subtracting the two expressions, we have d(7’+ V) = 0, from which we 
conclude that 7+ V is a constant to which we give the symbol E. Thus, 
E=T+YV. QED. 


7.1.9 Definition 
The position of the center of mass R, is defined by 


M=S>m. 


R,. = ~ se Mole: 
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7.1.10 Theorem 


pH= MV 
where, 
dR. 
Nee : 
dt 


Proof: By definition, 


R, = a S- Mola: 


If we differentiate this expression with respect to time, 


dR, 1 : 1 
Gem ag Dee ay 
Hence, 
p= MV.. 
QED. 
7.1.11 Theorem 
F= Ma, 
where, 
1 a We 
te 


Proof: If we differentiate p= MV, with respect to time, 


dV. 


P dt 


= Ma.,. 


QED. 
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7.1.12 Theorem 


If r’, are the position vectors of particles relative to the center of mass, then, 


> Met = 0: 


Proof: Relative to the center-of-mass, the position vector of the center- 
of-mass vanishes. Thus, if r/, are measured relative to the center of mass, 


R,. = a7 Mata = 0, 


which implies that > mar’, = 0. QED. 


If vy = V.+ vi, where v/, =r, then, 
Sev, =0. 
Proof: Differentiating {mar!, = 0 with respect to time, we have 


Nev, = 0. QED 


7.1.13 Theorem 


The total angular momentum of a system with respect to a fixed point is 
given by: 
£=R,x MV.4+ (rt, X Maeva), 


L€., 


L= lon t £. 
Proof: Let ra = R.+1r/,. Then, by definition, 


£=S org X MaVa = > (Re + 14,) X Mo(Ve + Vy) 


= (R. x Ve >) ma + Re x (05 Mav) +O mar) X Vet Dory x MeV. 


However, since > mv, = 0 and } mar’, = 0, we have, 
£=R, x MV.+ > 4), xX MaVy = bem + £. 


QED. 
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7.1.14 Theorem 


The kinetic energy of a system can be decomposed, 
1 2 1 12 
(i 5MVe he ye gMava: 


Proot: 


1 1 
‘ey xMava =e gimalVe +v!)-(Ve+ vi) 


= 5 LlmaV? + Veo Dd Maa + 5 mati? 


But, we have > myv/, = 0, so, 
T= 1 v2+ s aie ve 
2 . 7 eae 


QED. 
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Chapter 8 


Systems of Interacting 
Particles: Two or Three 
Particles 


8.1 Two-Body Central Force 


8.1.1 Definition 


A force that depends only on the distance between particles and is directed 
along the line joining the particles is a central force. Central forces are 
conservative. 


8.1.2 Reduced Mass 


In this section we will consider two particles that interact via a central force. 
As a type, we will use force that satisfies an inverse square law, i.e., the force 
on one particle exerted by the other depends inversely on the square of the 
distance separating them. However, in treating this specific choice of force 
law, we will use techniques that can be used with the more general family 
of force laws, k/r”. We will show how a “reduced mass” and constants of 
the motion can be used to reduce the two dimensional problem to a one- 
dimensional differential equation for motion of a particle in an “effective 
potential.” We will also show how the time-dependent differential equation 
can be converted to a differential equation for the orbit, i-e., an equation 
relating r and @. As you see these ideas introduced, remember that they are 
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more general than the specific (but very important) problem of the inverse 
square law of force. 

Locate a set of axes at the center of mass which will be some point on 
the line connecting the two particles. Then, relative to the center of mass, 


Mir, + More = 0 


L= ry, X MV, +12 X M2V2 


1 1 
T= xe al sinava: 
Define, 
r=f,—f9. 


Thus defined, r is the vector position of m, relative to mz. We can solve the 
two equations, 
mr, + Moo = 0 


r=fr,—fYro2 
to give, 
mg 
ry = ————_ 
my, + Mg 
™my 
ft) = —————r 
my + mg 


and, differentiating with respect to time, 


mg 
v= 
my + Mes 
™my 
V2 = —————-V. 
My -++ ms, 


We can then rewrite the angular momentum and kinetic energy in terms of 
r and v, 
mm ,™Ms9 
L=T ¥<—<—_——— 
my + Meg 
a 1 my z™Ms 2 
2 2 my,+™Ms 
If we define the reduced mass 
my z™Ms, 


>) 
my + Meo 
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we then have the the angular momentum and kinetic energy written in a 
form suggestive of single particle motion, 


€=rxmv 
T = —=mv’. 
2 


Note that if mz >> mj, then m & mj, re is small, and r; & r. Such is 
the case, for example, for the earth revolving about the sun and an electron 
revolving about a nucleus. 


8.1.3 Constants of the Motion and Effective Potential 


If we adopt plane polar coordinates, r and 6 for our system, we may write, 


B= Sn(i? + 7°68) + V(r) 


L= pm(i? + 1B) — V(r), 


Because we are assuming a central force law, the problem has symmetry in 
0. Therefore 6 does not appear explicitly in the Lagrangian. Thus, we have 
a constant of the motion which turns out to be the magnitude of the angular 
momentum, 


L : 
l= ae = mr?6. 
00 
We may use the fact that 
el 
err 


to eliminate @ from the equations describing the system. Lagrange’s equation 
for r is 


or eae ee 
Or 
Eliminating 0, we have 
iit = eC OV 0 
mr Or 
Ls es 
B= smi aca ate Vee 


The energy equation now looks very much like the equation for the one- 
dimensional motion of a single particle with coordinate r. Of course, we 
have grouped a piece of the true kinetic energy inside the parentheses with 
V(r) to make the problem look one-dimensional. The true motion is two- 
dimensional with r and @ both varying. But, given that we have made it 
look like one-dimensional motion, it is natural to define an effective potential 


Very 
2 


Snipe + V(r). 


The term from the kinetic energy that we have moved over into the effective 
potential gets very large for small r. For a fixed value of E, the presence 
of this term keeps the system away from sufficiently small values of r and 
prevents the two particles from coalescing. Hence we refer to this term as a 
centrifugal barrier. The use of an effective potential is a general technique 
that can be used whenever there is enough symmetry to reduce a multi- 
dimensional problem to a quasi one-dimensional problem. 

If the effective potential is shaped (as a function of r) such that it has 
a minimum and if F is greater than the minimum of V but EF < 0, then 
the particle is said to be in a potential well and the motion will be periodic 
increasing to a maximum 7g, and decreasing to a minimum 1,7. The min- 
imum and maximum values of r are those values when 7 vanishes and all 
of the energy is represented by the effective potential. If E is exactly equal 
to the minimum of the effective potential, then the minimum and maximum 
radii merge into one and the periodic orbit is circular. 


Verh = 


8.1.4 Orbit Equation 


There is a second path that we can follow. Consider, 
€=mr6. 


i sn(?? + r26?) + V(r) 


e OV 
—-—,=-—=F 
ars Or (r) 
Define x = 1/r. Then, 
dx i ldrdt af 


do. db redid 6 
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Thus, 


__ fae 
~  mdé 
and, 
P am al 
‘ 2” de? 
Our equations then become, 
: L - 
d=—2x 
m 
d?x m F(a) 
ie Be 
Lee ae ae 
= -—(— —4V 
3 maa? 2m +) 


In this latter form, the solution of the equations yields the orbit equation, 
i.e., r as a function of @. 


8.1.5 Inverse Square Law: Attraction 


We will now take as our force law, 


k 


re 


F, = 


If k > 0, the force is attractive, such as would be the case for the gravitational 
force (k = Gm,mz) or the electrostatic attraction of two opposite charges, 
(k = |Q,||Qe|). If k < 0, the force is repulsive. The inverse square law force 
is conservative and derivable from a potential, 


For the inverse square law the orbit equation becomes, 


x 2 mk 

—+2=-—. 

dé? ei 
This equation has a transient solution « = Acos(@ — 69) and a particular 
solution x, = mk/¢?. Thus, 


1 mk 
t= — = — + Acos(d — 60) 
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where A and 6 are determined by initial conditions. If we let 09 = 0, then 
r will be at its minimum value rin (also called the perigee or perihelion) 
when 6 = 0. Henceforth, we will assume this initial condition. If there is a 
maximum at all, Pmaz will occur when 6 = 7. 


If we let 
Al? 
a 
mk 
and, 
(2 
p= ae 
we can put our solution in the form, 
ae 1+ecosé. 
ip 


This is a standard form for a conic section (circle, ellipse, parabola, hyper- 
bola) with origin at the focus. In this form e is identified as the eccentricity 
of the orbit. The respective cases are: 


e=0: circle 


e <1: ellipse 
e = 1: parabola 
e > 1: hyperbola. 


The equation of the conic section can also be written in terms of EF and ¢ 
instead of rmin and e. We may solve for rmin (and Tmax if its exists) in terms 
of EF and ¢ from the energy equation. These turning points occur when 7 = 0. 
Then, 


Solving, 

k; ke es 
+ 4/ — + ——. 

2E 4 F? a 2nE 


An alternative is to solve for 2, = 1/rm from 


et om 


Ski, =i 
2m 
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yielding, 
1 = mk mk 20? F 


ae ae mk?” 


If this latter form is compared to 


Lmax = 


= A 
Tmin Ue 
we can identify, 
mk 202 E 
A= 
ee mk? 
and, 
7 i 22°F 
= mk? 
so that we can write the orbit equation in the alternative form, 
0? /mk 


r= 


a a 5 ee (20?) /(mk?) cos 6 
The respective cases for the conic sections are then, 


me 0: cirel 
= ——~ > e=0: circle 
WP € ci 
E<0-e< 1: ellipse 
E=0—-e=1: parabola 
E>0—-e>1: hyperbola. 

If E < 0 (ellipse) we can also write the orbit equation in terms of the 
semi-major axis a and the semi-minor axis b. For an ellipse useful relations 
are, 

Tmax = a(1 + e) 
Trin = all —e). 
e? = 1-(-)?. 
a 
Then, 
_ a(l—e?) | (b? /a) 
1+ecos@ 14/1 — (b/a)? cos. 
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Comparing to the earlier form in terms of F and @, we can identify, 


b? e 

a mk 
(ey _ _2CE 
a mk? 

from which we can easily show, 

ok 

aaa 5 
a 
—2mE 


We are now position to prove Kepler’s Laws: 


8.1.6 Theorems: Kepler’s Laws 
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1. Each planet moves in an ellipse with the sun as a focus. 


2. The radius vector from the sun to the earth sweeps out equal areas in 


equal times. 


3. The square of the period of revolution about the sun is proportional to 


the cube of the major axis of the orbit. 


Proof: We have demonstrated above that the orbits in a central, inverse- 
square law force field satisfy the standard form for an ellipse, 


D 


—=1+ecos0 
7? 


in the case that e < 1. This constitutes the proof of Kepler’s first law. QED. 
We have also demonstrated that the two-body system has a constant of 
the motion (angular momentum), ¢ = mr?6. If we write this as 


m 2m1 


dt = —r7d@ = —-=(r)(rd6), 


e £2 


We observe that 1/2(r)(rd0) represents a small element of area swept out by 
the radius vector of the position of the planet. (Think of a very long, narrow 
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triangle with altitude, r, and base, rd.) Thus, the radius vector sweeps out 
equal areas in equal times. This is Kepler’s Second Law. QED. 

Kepler deduced the first two laws empirically from detailed analysis of 
data gathered earlier by Tycho Brahe. They were first published in 1609. 
The third law was published later in 1618. Observe that Kepler’s Second 
Law is a consequence only of the central nature of the force and not of its 
inverse-square dependence. 

If we integrate Kepler’s Second Law over one period and use the formula 
for the area of an ellipse, we have, 


But, 


so we have, 
2m el 


poe. 
Ta av 


Since k = Gm,mz and m = (m mz2)/(m1 + m2), we have, 


Ara 


Ps, 
G(m, + mz) 


QED. 


8.1.7 Inverse Square Law: Repulsion 


If the inverse-square law force is repulsive rather than attractive, then the 
force constant k < 0. We must then modify our derivation of the orbits 
slightly by returning to 


———} Xx — ee 

dé? e 
and replacing & with —|k|. The explicit minus sign then results in our solution 
being written as, 

mk 


ar + Acos(@ — a). 


Consequently, we have 
a= —l1+ecos8, 
ip 


101 


where, 


and, 


mk2- 
In this case of repulsion we always have E' > 0 and e > 1; the orbit is always 
a hyperbola. 


8.2 Cross Sections 


We now turn to another use of two-body interactions that serves as a type 
for a much more general technique of physics. When physicists try to under- 
stand the atomic and subatomic structure of matter, they are faced with the 
problem that atoms are very tiny things indeed. Since the time of Ruther- 
ford, people have turned to scattering experiments to probe the structure 
and physics of interaction of atoms and elementary particles. In these exper- 
iments a beam of particles is projected onto a target and detectors observe 
the directions, energies, momenta, etc. of the particles that come out. 

What goes in is a beam, a stream J, of some number of particles per unit 
area per unit time. This is a directly measurable quantity. What comes out 
is a number of particles dN per unit time. This also is a measurable quantity. 
The ratio of what comes out to what goes in has the dimensions of [area] 
and is called a differential cross section., 


_ dN 
==. 


The cross section da may depend on the energy of the impinging particles, 
on the physical size of the projectile and target particles, on the energy, 
momentum, charge, spin, angle of scatter and type of the emerging particles, 
etc. In short, much of the physics of what went on in the interaction is buried 
in do. Theorists calculate cross sections and experimentalists measure them 
for comparison so that the theories can be sorted for validity. It would be 
hard to overemphasize the importance of the concept of a cross section in 
modern physics. 

Cross sections are defined for processes. If you project a beam of particles 
and something happens as a result, you can define a cross section for that 


do 
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process. We are going to concentrate on a particular process, but the ideas 
we use are much more general than the specific example. The process we 
choose is the scattering of alpha particles from gold nuclei, the process that 
allowed Rutherford to postulate the very existence of a nucleus. We will 
use reduced mass coordinates. Since the gold nucleus is much more massive 
than the alpha particle, the description is very nearly the scattering of alpha 
particles from a fixed center of force. 

Consider a beam of charged particles particles J impinging on a single 
charged target particle such as a nucleus in a very thin foil of gold. The 
particles emerge from the target deflected by an angle ¢ from their original 
direction. If we draw a line parallel to the beam running through the center 
of the gold nucleus, the beam will have particles whose line of approach is at 
varying distances from this line. The perpendicular distance from the center- 
line to the line of approach of each particle is its «impact parameter s. Particles 
with very large impact parameters do not get very close to the nucleus and 
are scattered very little. Particles with very small impact parameters come 
very close to the nucleus and are scattered through a large angle. Hence, we 
conclude that there is a functional relationship between s and @. Particles 
with impact parameters between s and s+ ds are scattered into a cone with 
angles between @ and ¢ — d@. The cone has a solid angle dQ = 27 sin ¢d¢. 

The process that we are trying to characterize is the scattering of alpha 
particles into the cone at angle ¢. All of the beam particles that pass through 
an annulus (while still very distant from the target) defined by 


do = 27sds 


will be scattered into the cone. The ratio da/dQ is the differential cross 
section per unit solid angle, 


do 27 sds fone 8 ds 
dQ  22xsinddd — sinddd’ 


(The negative sign is added because increasing s results in decreasing ¢.) If 
we can establish a functional relationship between s and ¢, we can calculate 
ds/d@ and have a theoretical prediction for the number of particles expected 
per second per unit solid angle per target nucleus. 

The force law between the alpha particles and the gold nucleus is an 
inverse-square law. In this case, the force is repulsive. The total energy is 
the kinetic energy of the incoming particle while it is still at great distance 
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of separation and must therefore be positive. The alpha particle follows a 
hyperbola. While there is a point of closest approach rmjn, there is NO Tmaz- 
The alpha particles approach from —oo and recede to oo. 

We have already seen the equation for the hyperbola, 


— (€?/mk) 
144/14 2@E)/(mk?) cos 4 


where @ is measured from the point of closest approach, rmin. If vo is the 
speed of the beam particles while they are far removed from the target, then 


B= xiv 
and, 
£=rxX mvo. 

The magnitude of the angular momentum is given by = mvgs. Since @ is 
measured from the point of closest approach, let the angle after scattering 
when the beam particle has moved far away from the target be a. Since the 
approach and the recession of the beam particle are symmetric with respect 
to the point of closest approach, we have 


2a+o=T, 
L.€., 
cos a@ = sin(¢/2). 


After scattering, r recedes to oo, becomes a, and the denominator of the 
right-hand-side of the expression for the hyperbola must approach zero. We 
then have, 

14+ 1/14 (4s?E?) /(k?) sin(¢/2) = 0 
which is our desired functional relationship between s and @. After differen- 
tiation and use of some trigonometric identities, we have, 


do ik 1 
dQ 16H? sin*(¢/2)’ 


This is the differential cross section per unit solid angle for Rutherford 
scattering. Observe that it is not purely a geometric area. The expression 
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depends on the energy of the beam particles. Beam particles of high energy 
are not scattered as much as beam particles of low energy. There is also a very 
sharp dependence on angle. For very small scattering angle (corresponding to 
minimal deflection) the cross section is very large, meaning that most beam 
particles pass through the target virtually undeflected. 

The total cross section is defined by, 


o= | do. 


In the present case we would compute this as, 


— [2 —dQ = fo oF on sin odd. 


For Rutherford scattering, the total cross section is infinite. Since the inverse 
square law is a long-range force, all beam particles are scattered to some 
degree and the target nucleus acts like it reaches out to infinite size. For 
short range forces, the total cross section is well defined. 

Total cross sections are used in a variety of ways. If a molecule moves 
through a gas of number density n, the mean free path A between collisions 
is related to a cross section, 

pean 
no 
and the collision frequency v is, 


PSo) 2] 0ne: 


If a beam passes through a gas of number density n which removes particles 
by some process characterized by a cross section, the attenuation as a function 
of the distance through the gas x is given by, 


l= Iga: 2°, 


8.3. Problems 


1. A particle in a central force field moves in a spiral orbit r = c6?. 
Determine the form of the force function as a function of r and how 
the angle varies with time. (Partial answer: 


6x t/5 
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The problem is really two separate problems. The first part is intended 
to illustrate the general method by which the differential equation that 
relates r and t is transformed to one relating r and 6 using a constant of 
the motion and the substitution r = 1/zx. This is necessary in this case 
because you are given a relationship between r and @ from which you are 
to infer a form for the central force. The second part can be worked 
directly from the constant of the motion identified from the missing 
coordinate in the Lagrangian. After completing the problem, review 
the solution and identify techniques that have wider applicability than 
this particular example. 


. Show that for a circular orbit, 


2 C 
Ue 


> MPmin 
If vp is the speed of a particle at 6 = 0, show that one may write the 
orbit equation for inverse square law motion as, 


pS (v9 / Ve)? 
™"1 + [(v9/v-)2 — 1] cos 

The purpose of the problem is to demonstrate how the orbit equation, 
p/r = 1+ ecos9, can be transformed to another useful form. The force 
here is assumed to be of the inverse square type, F = —C/r?. 


. A moon rocket is initially travelling in a circular orbit near the earth. 
It is desired to place the rocket into a new orbit for which the apogee 
is equal to the radius of the moon’s orbit around the earth (240,000 
miles). Calculate the ratio of the new speed to the speed in the circular 
orbit needed to accomplish this. Assume that the radius of the original 
circular orbit is 4000 miles. Draw a sketch to illustrate the maneuver. 
Calculate the new apogee if the speed ratio is 0.99 of the value calcu- 
lated above. The result should illustrate the extreme accuracy needed 
to achieve a circumlunar orbit. (Note: This problem is meant to be an 
application of the result of the previous problem.) 


. A comet is observed to have a speed vp when it is a distance rp from 
the sun (of mass M) and when its direction of motion makes an angle 
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@ with the radius vector from the sun. Show that the eccentricity of 
the orbit can be written: 


IGM \ r2v2 sin* d 
exit (8) Sona 


5. The earth’s orbit is nearly circular e=0.017, aphelion=95,000,000 miles, 
perihelion = 91,000,000 miles). Assuming the earth’s orbit to be cir- 
cular, show that 


2 2, ,2 1/2 
ES ( + es — =e Ot in?) ; 
U To eve 

If a comet is first observed to be at a distance of 1/3 astronomical unit 
(1 a.u.=93,000,000 miles) from the sun and is observed to be travel- 
ing at a speed of twice the earth’s speed, is the comet’s orbit elliptic, 
parabolic or hyperbolic? (Grant R. Fowles, Analytical Mechanics, Holt 
Rinehart Winston, 1970.) 


6. A projectile is fired at an angle of 60 degrees above the horizontal and 
at a velocity of 5 x 10° cm/sec. Neglecting air resistance, what would 
be the range of the projectile measured on the earth’s surface. To what 
height above the earth’s surface does the projectile rise? Neglect any 
effect of the earth’s rotation. The purpose of the problem is to be an ex- 
ample of orbital equations applied to the motion of an intercontinental 
ballistic missile. 


7. Suppose that it is possible to measure the location of the center of mass 
and the period of rotation of a double star system. The orbits, both 
individually as well as relative to one another, may be considered to 
be approximately circular. r; and rg are measured from the center-of- 
mass and are also assumed known. Find formulas for the masses of the 
stars in terms of these data. The purpose of the problem is to illustrate 
an application of Kepler’s Third Law and center-of-mass coordinates. 


Partial answer: 5 


~ GP? 


my (ry +12)?ry. 
8. Computer Project 9 (See Appendix) 
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9. 
10. 


11. 


12. 


Computer Project 10 (See Appendix) 
Computer Project 8 (See Appendix) 


Hard spheres of radius 6 are elastically scattered by hard spheres of 
radius a. Find the relation between the scattering angle and the impact 
parameter in the center of mass coordinate system. Find the differential 
scattering cross section per unit solid angle, do/dQ, in the center-of- 
mass frame. Find the total cross section. The purpose of the problem 
is to illustrate the general defining relationship of the differential cross 
section, 


do _ s ds 

dQ ~~ sing dd’ 
The impact parameter is s and the scattering angle is ¢. An analysis 
of the geometry of the situation is used to relate s and @. 


e A particle of energy E crosses a boundary where the potential 
changes abruptly from 0 to —V,;. As a consequence, the direction of 
its motion changes from 0; (relative to the normal to the interface) 
to 65. If the “index of refraction” is defined as 


n = sin0,/ sin 62 


n= /14+(\/E). 


e Frequently used in nuclear physics is the “square well” to give an 
approximate description of the interaction between nucleons. It 
is defined by V(r) = —Vo for 0 < r < a and V(r) = 0 for r > 
0. Sketch the effective potential for such an interaction between 
two particles. Can bound orbits exist? Find do/dQ and do(¢) 
for scattering by such a potential. The results are conveniently 
expressed in terms of the index of refraction defined above. 


show that, 


Solution: Convince yourself that: 


(a) 
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a=, —02 = ¢/2 


s=asin6}. 


Here, s is the impact parameter. 
Then show, 
ce a*n? sin?(¢/2) 
1+ n? — 2ncos(¢/2) 
Eventually show, 
do _ a?n? — (ncos(¢/2) — 1)(n — cos(¢/2)) 
dQ = 4cos(¢/2) = (1 +n? — 2ncos(¢/2))? 


(T. C. Bradbury, Theoretical Mechanics, John Wiley & Sons, 
1968) 
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Chapter 9 


Systems of Interacting 
Particles: Rigid Bodies 


9.1 Definitions and Theorems 


9.1.1 Definition 


A rigid body is a system of many particles constrained in such a way that the 
particles making up the body are held fixed relative to one another. 


9.1.2 Definition 


With respect to an axis, the moment of inertia I of a mass distribution is 


defined, 
I=) (mar3) > [ ram. 


where rq is the perpendicular distance from a mass particle to the axis. 


9.1.3 Theorem: Law of Parallel Axes 


If a rigid body consists of a thin lamina, i.e., all the particles lie essentially 
in a plane, and if r’/, is the position vector of a particle relative to an axis 
perpendicular to the plane and through the center of mass and if R is a 
vector from an arbitrary axis (also perpendicular to the plane) to the center 


of mass, then 
I = Tem + MR’, 
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where J is the moment of inertia about the arbitrary axis, I. is the moment 
of inertia about the center of mass and M is the total mass of the system. 

Proof: Let two parallel axes be defined which are perpendicular to the 
lamina and thus parallel to one another. Let O be the intersection point of 
the first axis with the lamina and let rg mark positions relative to this axis. 
The second axis passes through the center-of-mass at O’. R is the position 
of O' relative to O. We then have rz, = R+r/,. Hence, 


T= Yi mara = > Malt, + R) + (ry + R) 
= So mare + 2R-S > marl, + Re mg 
But, if O’ is the center-of-mass, > mar’, = 0. Hence, 
I = Ion + MR’. 


QED. 


9.1.4 ‘Theorem: Law of Perpendicular Axes 


The moment of inertia of any thin lamina about an axis normal to the lam- 
ina is equal to the sum of the moments of inertia about any two mutually 
perpendicular axes passing through the given axis and lying in the plane of 
the-lamina, fy := 1. 4Pd5). 

Proof: Let the x and y axes lie in the lamina with the z axis perpendicular 
to the lamina. Then, 


I, = Yo mer = >) max, +: 93) = Ie + Ty. 


QED. 


9.1.5 Moments of Inertia of Uniform Objects 
1. Thin rod of length a 


e Normal to the rod at its center: 4ma 


e Normal to the rod at one end : 3ma 
2. Thin rectangular lamina with sides a and b 


e Through the center, parallel to b: sma? 
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e Through the center, perpendicular to the lamina: 4m/(a? + b*) 


. Thin circular disk of radius a 


e Through the center in the plane of the disk: +ma* 


e Through the center normal to the plane of the disk: 4ma 
. Thin hoop or annulus of radius a 


e Through the center in the plane of the disk: sma? 


e Through the center normal to the plane of the disk: ma 
. Thin cylindrical shell or radius a and length b 
e Central longitudinal axis: ma? 


. Solid right circular cylinder or radius a and length 6 


e Central longitudinal axis: $ma? 


e Through the center, normal to the central axis: m(S + o) 


. Thin spherical shell of radius a 


e About any diameter: ma? 


. Solid sphere 
e About any diameter: 2ma? 


. Solid rectangular parallelepiped with sides a, b, and c 


e Through the center normal to the plane of a and 6 and parallel to 
were >. 
C: salar + b*) 
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9.2 Motion of Laminar Bodies about a Fixed 
Axis 


9.2.1 Theorem 


If a rigid body consists of a thin lamina and if the body rotates about a fixed 
axis which is normal to the plane of the body, then the governing equations 
of motion are: 
£= J0n, 
N = — = /6n, 
dt 


and, 
Des 
(ie be 
2 


Proof: Let n be along the axis of rotation, i.e. perpendicular to the 
lamina with O located at the point of intersection of axis and lamina. Then, 


£= yrs EV: 


Because the origin is at the axis of rotation, rg is perpendicular to v, so 


that, 
£=n ae PAs =D S- Molo(To) = (0) S- Mal 
= In. 
QED. 


Then, since J and n are constant in time, we may differentiate with 
respect to time to obtain, 


de 


=— = /6n. 
dt 
QED. 
Finally, 
1 1 . 1. 1 
ie S- Mara = ‘2 5 Ma(ra)” = 50 S- Mala 
= £162, 
2 
QED. 
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9.3. Laminar Motion 


9.3.1 Definition 


If a rigid body is constrained to move parallel to a plane (such as a wheel 
rolling on a flat surface or a block sliding down an inclined plane), the motion 
is said to be laminar motion. 


9.3.2 Theorem 
The governing equations of laminar motion are, 
F = Magn, 
Noa = Tomb, 

and, 

1 

2 
Proof: We have already proven for any system of particles: F = Ma,,,, 

i.e. that the system of total mass M taken as a whole obeys a form of 

Newton’s Second Law for the motion of the center-of-mass. F is the net 


force on the system. If P is taken as an arbitrary fixed point, we have also 
shown that relative to this point, 


1 ; 
T= 5M Vem ey Er ae 


ae 

These two results apply to the more specific case of laminar motion as well. 
It is also true generally as well as in the specific case of laminar motion, 
that the angular momentum and kinetic energy can be decomposed into two 


terms, 
Lp = Rem X MVem + ety X MaVi,) = Lom +L 


i 1 
(e— 5 MV ern +5° ava 


Thus, the latter two equations each decompose into two parts which describe 
a “particle” of total mass, M, located at the center-of-mass and which de- 
scribe motion of particles relative to the center-of-mass. We may use the 
decomposition of the angular momentum to show, 

d€p dR. AVem _ dlem 


xX MVem + Rem X 


Noe ey 
pe at dt dt dt 
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dL om 
dt — 
The first term on the right vanishes. Using F = Ma,,,, we have, 


_ (Ghee 
dt 


We see once again the decomposition into a term representing the center-of- 
mass “particle” and action relative to the center-of-mass. 

Therefore, we may divide our problem into two parts. One part is the 
motion of the center of mass, which obeys the equation, F = Ma,,,. We 
may then solve, as a second part, the problem of motion relative to an axis 
through the center-of-mass perpendicular to the lamina and thus reduce it 
to the problem of laminar motion relative to a fixed axis. The unit vector n 
lies along the axis. Thus, the governing equations for the system are, 


= Vem X MV em + Rem X Maem + 


Np + Ron X F = Nen + Rem x F. 


F = Mam, 
Nii = TomOM, 
and, 
T = SMV? 4+ SDem(? 
<8 9 cm 9 cm : 
QED. 


9.4 Rotational Motion with respect to a 
Fixed Point or the Center of Mass 


9.4.1 Definition 


An inertial set of axes relative to which a rigid body moves is called a space 
set of axes. A set of axes fixed to a rigid body is called a body set of azes. 


9.4.2 Euler’s Equations 


Consider a body set of axes fixed to a rigid body at the center of mass. The 
governing equations are then, 


£=R, x MV.+ Sor, X Mavi, 


and, 4 

N= TE” 
Since the translational motion of the center of mass decouples from the mo- 
tion relative to the center of mass, we may focus on the rotational motion 
relative to the center of mass taking the motion of the center of mass itself to 
be a separate problem. The problem of the rotation relative to the center of 
mass is completely equivalent to motion of a rigid body with one constrained 
fixed point (not necessarily a fixed axis), although in the latter case the point 
need not be the center of mass. In either case, the general result, 


vevt+wxr'4+Vo, 


reduces to, 
—— ! 
Vv SOK T,. 


Then, relative to the center of mass or to the single fixed point, 


Le ig RV = ig RT, |e 


We may reduce the triple cross product to yield, 
£= Si mgalwre — ro (ra: w)] 


= De malwiei)ra — (Cas) (Ca js)'] 


= oD Mea lOgha = UU gj lWG; 


= TI lal 
= [, 8; 


=I-w. 


The matrix J;; has moments of inertia [,,[4, and I, on the diagonal. These 
are the moments of inertia about the three body axes respectively. Because 
the body set of axes is fixed to the object, the moments of inertia are con- 
stants, i.e., they do not change with time. The off-diagonal elements of J 
are called the products of inertia. They also are constants in the body set 
of axes. If the body axes are chosen carefully, the products of inertia can be 
made to vanish, thus simplifying most problems. 

A body set of axes in which the products of inertia vanish is called the 
principal axes and the corresponding moments of inertia are the principal 
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moments of inertia. If you consider a typical moment of inertia tensor with 
off-diagonal elements as a matrix, you can find the principle moments of in- 
ertia by finding the eigenvalues of the matrix. The eigenvalues of a matrix 
then become the diagonal elements of its diagonal form (the principal mo- 
ments) and the corresponding eigenvectors are the along the principal axes. 
However, in most simple cases, one can find the principal axes by inspec- 
tion using symmetry arguments. For example, you may be able to exploit 
symmetry to make the product of inertia, 


on = SS atete 


vanish if an axis can be oriented such that for every x-value, there is both a 
positive and a negative y-value for equal masses so that in the sum a positive 
term cancels a negative term. For example, for a square plate, the principal 
axes could be located at the center of the plate with x and y axes parallel to 
the sides of the plate and the z axis perpendicular to the plate. For a circular 
disk with origin at the center, any two perpendicular diagonals would serve 
as principal axes, with a third perpendicular to the plane of the disk. 
If we take the time derivative of 2£, 
de dé’, 


— at The hal / 4 


at ORE ese Pas <b ae ek 
= [,,Wje; + Ti 0%; (Oinr@,w) 


— hos} ‘yl '\al 


Thus, we have Euler’s Equations, 


| ES Bf ‘yl / 


If the axis set is a principal set of axes, the Euler equations have the 
simpler (and more common) form: 


, Poss / / / / 
N, = Lyw, + Ws (133 — Ih), 


a Poss / / / / 
Ng = Ip 9W + Ww W3 (Thy _ T33), 


i | ZA f / / / / 
N3 = 133W3 + WW (Too — [h1). 


slg 


9.4.3 Theorem 


1 1 1 


Proof: If we exclude any translational kinetic energy and thus restrict 
ourselves to the rotational kinetic energy relative to a fixed point or to the 
center-of-mass, we have, 


1 1 
= 5 Made = 5 Mal xr): (wx ri) 
1 Pat ry 
= 9 So Mel Sign; 2p) (SitmW1X om) 
1 Peobiade 3 
= 5 S- Mal(djdkm = OjmO1k )W {WX oh L om 


1 
= Suhel Doma dur? — ah) 


1 
QED. 
9.4.4 Theorem 
dT 
Bee NAT 
dt jj 


Proof: Multiply Euler’s equations by w/ to form the sum, 
Nya = [i juin, + wi diagno, Liu}. 


Since, the double sum Wijk = 0) (because even and odd permutations of 
dijk have opposite signs), we have, 


QED. Thus, the rate at which the net torque does work on the rigid body is 
equal to its time rate of change of kinetic energy. 
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9.4.5 Euler Angles 


The solution to Euler’s equations yields angular velocity, not position. To 
specify the arbitrary position of a rigid body (ignoring translational motion) 
requires three generalized coordinates since the arbitrary orientation in space 
has three degrees of freedom. If we freeze the body axes to the rigid body, 
we can equivalently specify the orientation of the body axes relative to the 
space axes. A common way to do this is in terms of three angles (called Euler 
angles) defined in the following way. 

Imagine a space set of axes and a body set of axes. Imagine originally 
that the two are identical, with the x3 and 24 axes coincidental and oriented 
vertically. Imagine the x, and x, axes coming out of the page and the x2 
and xi, axes lying horizontally to the right in the page to complete the triad. 
Now imagine rotating the primed axis set around the common 23, 75 axis 
through an angle ¢. This moves the «| axis toward the right and the x}, axis 
back into the page. The angle ¢ is the first of three simple rotations which 
define the Euler angles. If the angle ¢ increased in a time-dependent way, 
the corresponding angular velocity would be ¢é3. 

The new 2 axis is called the “line of nodes.” Define a unit vector, n, 
along the line of nodes. Now, imagine that the line of nodes is held fixed and 
that the primed axis system is rotated through an angle 6 about it. This 
rotation separates the x3 and x3 axes by the angle 6. The angle 6 is the 
second of the Euler angles and a corresponding angular velocity about the 
line of nodes would be 6n. 

Finally, hold the new x axis fixed and rotate the primed system through 
an angle w about the x3 axis. This is the third of the Euler angles and the 
corresponding angular velocity is Wes. 

The three angles @, 6, ~ are the Euler angles and they specify an arbitrary 
orientation of the body (primed) set of axes relative to the space set of axes. 
An arbitrary angular velocity is given by, 


w = G63 + On + Yes, 
where, 
nh = cos we}, — sin wes. 


The utilitarian value of the Euler angles is greatly increased if we can 
relate the unit vectors in the two sets of axes (body and space) to one another 
via the Euler angles. This is done in Table 9.1. 
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Table 9.1: Unit vectors of the body axes as combinations of unit vectors 
of the space axes (read horizontally) or unit vectors of the space axes as 
combinations of unit vectors of the body axes (read vertically). 


e; €y 63 
é, cosdcos~¢ — cos @sin dsin w sin @cosw + cos@cosdcosw sind sin w 
e, —cosdsiny —cos@singcosw —sindsinw +cosAcos¢cosw sinécosw 
e, sin sin @ — sin cos @ cos 6 


For example, using Table 9.1, 
w = ¢é63 + On + ve, 
= ¢(sin é sin é), + sin 0 cos We, + cos 05) + O(cos wel, — sin és) + Pes. 
By collecting the coefficients of the unit vectors, we then have, 
wi = dsinO sin w + Ocos x), 
w!, = dsin 6 cosy — Osiny, 
ws, =v + dcosd. 


These latter expressions are useful for expressing the kinetic energy in terms 
of Euler angles and their time derivatives via, 


1 /7yl / 
i iiss: 
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9.5 Problems 


ie 


A sphere of radius a rolls off a hemisphere of radius b placed convex- 
side-up on a table. Does the sphere leave the hemisphere before it hits 
the table? At what angle? How does this compare with a particle that 
slides from a frictionless hemisphere? 


Find the inertia tensor for a square plate of side a and mass m in a 
coordinate system O'z'y'z’ centered at one corner with the x’ and y’ 
axes along the two edges. Find the angular momentum about O’ of 
the above plate when it is rotating with angular frequency w about the 
diagonal of the plate that passes through O’. Find the kinetic energy 


of the above rotating plate. 
Partial Ans: 


For the plate above, find the principal moments of inertia, i.e. moments 
of inertia in the axis system in which the products of inertia vanish. 
Find the directions of the principal axes. Principal axes are axes chosen 
so that the products of inertia (the off-diagonal terms in the inertial 
tensor) vanish. 


Find the moments and products of inertia of a uniform rectangular 
block of sides a, b, c for a coordinate system with origin at one corner 
and with axes along the edges of the block. If the block spins about a 
long diagonal through O’, find the angular momentum about the fixed 
origin. Does your answer reduce to the result above for a thin plate? 


Consider a thin square plate with one corner fixed (at which we center 
body axes) and rotating about the z’ axis (oriented along one side) with 
constant angular speed w. Find the torque necessary to maintain the 
angular velocity along this axis. If you orient the x’ axis along the 
diagonal, what torque is now needed to maintain the angular velocity 
along this new direction? What obvious advantage is there to orienting 
the angular velocity along the principal axes of the plate? 


eal 


6. Find the solution to Euler’s Equations in the absence of torque for 
principal axes in which Ij; = [92 = J. Assume I < [33 and w3 > 0. You 
might try defining 

Ys 7 W3 
and using complex numbers to uncouple the equations. 


7. Find the solutions of Euler’s equations for principal axes when J,,; = 
Ig > I33 and first component of torque Nj as measured in the body 
coordinates is not zero but is constant. Assume w3 > 0. (Note: Observe 
that in this problem Ig2 > [33 but that it is just opposite in the previous 
problem.) 

Ans: 
ee sie 133 
In 
w3 = constant 


3 


w, = acos y(t — to) 
Ny 
w3(Ii1 — I33) 


8. An artificial satellite which is approximately cylindrical in shape has 
principal moments of inertia [;; = Ig = 10*slug ft? and Iz; = 0.4 x 
10‘slug ft?. Initially, the satellite is not rotating. There are two rockets 
mounted directly opposite one another parallel and antiparallel to the 
x, body axis such that they cause a torque in the positive x3-direction. 
These rockets have a thrust of 10 lb and are 4 ft from the center of 
mass. The are fired for 100 sec and then shut off. Now rockets, which 
are at opposite ends of the satellite mounted parallel and antiparallel 
to the body x-axis such that they produce a torque in the positive 71- 
direction, are fired for 2.62 sec. These rockets each produce 300 Ib of 
thrust and are 10 ft from the center of mass. Assume that the amount 
of material ejected by the rockets is small enough so that the moments 
of inertia are not affected. Find the components of angular velocity in 
the body set of axes for the final motion. (T. C. Bradbury, Theoretical 
Mechanics, John Wiley & Sons, 1968). 


Solution: This is a straightforward application of Euler’s equations 
and is meant to illustrate the results of the two previous problems. 


W2 = —asin y(t — to) — 
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The initial condition is w, = w2 = 0 at t = 0 and should allow you to 
evaluate arbitrary constants. At the time all motors are shut off you 
should be able to show that w; = 0, w2 = —1.0/sec, and w3 = 2.0/sec. 
By reestablishing your time origin at the time the motors shut off, you 
should be able to show that thereafter, 


WwW, = —sinyt 


W2 = —cosyt 


w3 = 2.0/sec. 


9. For the previous problem, what is the angle between w and ¢? What 
is the angle between w and the body x-axis? At what angular rate 
does w precess about the body 5-axis? At what angular rate does w 
precess about @. 


Solution: 


e Draw carefully a diagram and read most of the answers from it. 


e Since @ is a constant of the motion, it will have a constant mag- 
nitude and a fixed direction in space. It might be helpful to draw 
your diagram at the time when the precessing w has a zero com- 
ponent along the x-axis and only w. and w3 are non-zero in the 
body axes. 


e To find the angular rate of precession of w about ¢, whisper softly 
to yourself the magic words: “The polhode cone rolls on the her- 
polhode cone without slipping.” With @ and w sharing a common 
origin, the herpolhode (or space) cone is the cone that w traces 
out as it precesses about ¢ where @ is fixed in space. The polhode 
(or body) cone is the cone that w traces out as it precesses about 
the symmetry axis (x3) of the object (body). Can you visualize 
the motion of the object being represented by the rolling of the 
body cone on the space cone without slipping? (Ans: 1.28/sec). 


10. Experimental: Go to the walk-in laboratory area designated by your 
instructor and find the air-suspension top apparatus. Do the following 
and write a simple report about what you did. 


e Demonstrate the effects of precession, retrograde motion, nutation 
and the so-called “sleeping top.” 
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e Find the torque on the ball when the rod is attached, measure the 
precession frequency, and calculate the rapid (and therefore hard- 
to-measure) rotation frequency of the ball. Make sure the ball is 
rotating really fast. Minimize the nutation so that the motion is 
pure spin and precession. 


11. Computer Project 12 (see Appendix) 
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Chapter 10 


The Special Theory of 
Relativity 


The Special Theory of Relativity can seem to be a fantastic, almost unbe- 
lievable, subject. In introductory courses, it is often illustrated with thought 
experiments involving rocket ships or trains moving near the speed of light. 
The disclaimer is usually made that the effects of relativity are very, very 
small in ordinary experience, thus giving the impression of an esoteric sub- 
ject of little practical consequence. 

But for many areas of physics, Special Relativity is very important and 
very practical. Relativistic effects are important in nuclear physics, elemen- 
tary particle physics, high-temperature plasma physics, beam physics, accel- 
erator technology, astrophysics, space physics, cosmic-ray physics and even 
sometimes in high-precision atomic physics. But even if it were not so, it 
would still be true that the education of a physicist is not complete until he 
or she understands this great revolutionary subject of the twentieth century. 
The four-vector formulation of Special Relativity can bring one to a deeper 
understanding and reveal an elegance and structure the one may otherwise 
not recognize. 


10.1 Galilean Transformation 
In developing the Newtonian mechanics of mass particles, we have made 


some assumptions. To be sure, we are reassured by experiment that these 
assumptions are true within the realm where the experiments have actually 
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been performed. For example, we have taken the masses of the particles to 
be constants that are independent of the motion of the particle. The mass 
is a measure of inertia, i.e. a measure of the resistance to acceleration by a 
given force. Chemistry of the nineteenth century concluded that mass was 
a strictly conserved quantity and that any motion of the particles did not 
change this conclusion. 

We have also made some assumptions about time and space. Consider 
a coordinate system with origin O that is fixed in space (the “laboratory 
frame”) and consider another frame with origin O’ moving with constant 
velocity v along the x-axis of the laboratory frame. For simplicity assume 
that the z’-axis is parallel to the x-axis. Assume that a clock hangs on the 
wall of the laboratory and that observers at rest in the laboratory or moving 
with the primed frame can read time intervals from this clock. This is a 
second assumption on our part, i.e. that the single clock on the wall of the 
laboratory serves stationary or moving observers equally as well. Assume 
that there is a particle of mass m that has position r’ in O’ such that its 
position in O is r = r’ + vt. The particle is not necessarily at rest in O’, 
but we are assuming that if it moves, it moves along the common x and 2’ 
direction. 

Now, if the particle moves in some small time interval dt that is read from 
the clock on the wall, we have, 


dx = dx’ + vdt 
dy = dy/ 
dz = dz’ 


and, since time is read from the single clock on the wall, 
d= de. 


We call this connection between the unprimed and primed coordinates 
of the particle a Galilean transformation. If we divide the first equation 
by dt, we get a familiar relationship between the speeds, u, = dx/dt and 
Uy = dx'/dt', along the x-axis, 


Ug = Ug + UV. 


When we search for the laws of physics, we seek generalities that serve the 
observer in the laboratory as well as the observer who moves with the primed 
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frame. Our own experience, perhaps in an airliner moving in a straight line 
at constant speed, tells us that uniform motion is not observable. By this 
we mean that we would observe no violation of the laws of motion that we 
deduced from experiments on the ground when applied to experiments in the 
airliner. This is very desirable, since one would not want to dignify with the 
label “law” a relationship that is only valid in a particular frame of reference. 

If we differentiate our velocity formula, remembering that v is constant, 


Ay, = Ay’. 
For Newton’s Second Law, this means that if F = F’, then 
F=ma 


and, 
F’ = ma’. 


Thus, Newton’s Second Law takes the same form in each of the two frames 
of reference, and this is made possible, in part, by the assumption that the 
Galilean transformation is correct as well as the assumption that the forces 
themselves are equal as seen from the two frames. 

To illustrate, imagine a platform on a flatcar on a railroad track that 
is moving at constant speed on a straight section of track. If a cannon 
is mounted to the platform in such a way that it fires a projectile exactly 
vertically, the projectile shares the horizontal speed of the flatcar when it 
is fired. This horizontal speed of both the platform and the projectile is 
unchanged (in the absence of air currents) during the motion of the projectile. 
Hence the projectile falls back on the platform exactly where it was first 
fired. Newton’s Second Law applied to the motion, either from the frame 
of reference of the platform or from the stationary laboratory frame at the 
side of the track, predicts this same outcome. The same law is applicable 
and works in both frames to describe the projectiles’s motion, although the 
motions will appear different to observers in the two frames. 

But, consider what happens if our particle is replaced by a photon. If 
the photon moves along the x-axis, the Galilean transformation tells us that 
the speed of the photon should be the sum of its speed in the primed system 
plus the speed of the primed system relative to the unprimed system. It 
is not so. In a paper published in 1965, researchers at CERN in Geneva 
produced two photons from the decay of a neutral pion. The pion (which 
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here defines the origin of the primed frame) was moving at 0.999 times the 
speed of light. The time of flight of these photons was measured over a 
distance of 80 meters and to within experimental error, the speed of both 
(moving in opposite directions along the x axis) was equal to the speed of 
light in vacuum. The motion of the pion made no difference in the speeds of 
the photons in the laboratory to within a precision of about 130 parts per 
million. This is a very remarkable result and it means that for photons, at 
least, something is very wrong with the Galilean transformation, something 
that was first suspected in 1887 when the Michelson-Morley experiment was 
first performed. 

If you try to fix the Galilean transformation with the simplest linear 
generalization of the Galilean transformation, one might try, 


dx’ = adx + Gdt 


dy’ = dy 
dz’ = dz 


and, 
dt’ = ydt + ddz. 


The coefficients a, 3, y,d would presumedly be chosen in such as way as to 
somehow account for the unexpected outcome of the pion experiment. In 
doing so, the Galilean transformation would be replaced by a more general 
one, the relationship between velocities would also be changed, and quite 
possibly, we would conclude that Newton’s Second Law itself as we have 
known it would have to be replaced with another equation in order to achieve 
form invariance (so-called covariance). Thus, the consequence of the pion 
experiment is far-reaching. 


10.2 Spacetime and the Lorentz Transforma- 
tion 

One of the consequences of our proposed new transformation is that our 

observers can no longer share the common clock on the wall. You can see 


this particularly in the fourth equation of the set where it is proposed to 
make dt’ equal to a linear combination of dt and dx. So, from henceforth, 
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we will provide observers with their own rulers and clocks which they carry 
with themselves to make measurements of space and time. 

Because of this mixing of space and time, it is helpful to relax the dis- 
tinction between space and time that is natural to us by beginning to think 
of a four-dimensional world. The “points” in this world are called events and 
have four coordinates. Three of these are the regular (x,y, z) coordinates 
of position relative to some suitably chosen origin and the fourth is a time 
coordinate measured by a clock that belongs to the frame of reference from 
which the event is observed. The events are “things that happen” at a cer- 
tain place and at a certain time. Each event can be specified uniquely by a 
quartet of coordinates, (x,y, z,t) or (x’,y’, 2’,t’) depending on the frame of 
reference and origin chosen. 

Separations between events can also be specified by a _ quartet, 
(dx,dy,dz,dt) and it is these numbers that are related by the new linear 
transformation that we are seeking. As long as the separations are very 
small, we can at least assume that the transformation is approximated by 
a linear transformation and it is for this reason that we concentrate on the 
transformation for the separations. If we think of the coordinates in the 
primed system as being some functions of the coordinates in the unprimed 
system, 

av, = j(x;,t) 
U= t (a2); 


then, for small displacements and a reasonably behaved transformation, the 
chain rule of differentiation yields, 


Or’ Ox! 
dread: “dt 
x, Du, Xi DE 

Ot Ot! 

/ ie * ———. 
dt’ = eal “+ oy dt. 


This is exactly the form of the transformation we seek. By “reasonably 
behaved transformation,” we mean one that is differentiable and has a unique 
inverse. 

Now think of the experiment that we used to illustrate Galilean invari- 
ance, i.e. the cannon fired vertically from a horizontally moving platform. 
Only this time we will use a photon that reflects from a mirror that is over- 
head and moving with the platform. The primed frame is attached to the 
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platform which we take to be moving from left to right along the direction 
chosen for the x and x’ axes. In the primed frame, the platform is at rest 
and the photon moves vertically, strikes the mirror at a distance dé away, 
and returns to its point of origin along the same line as the line of its ascent. 
The line is a series of events which is called a world line. 

We want to focus on two of the events of this world line of the photon. 
Event 1 is the event of the photon being produced. Event 2 is the event of 
the photon striking the platform after being reflected from the mirror. These 
two events occur at exactly the same place in the primed system, but the 
time between them is the time it takes for the photon to travel a distance 
2dé at the speed of light, c. Thus, in the primed system, 


dx’ = 0 
dy’ =0 
dz’ = 0 
a 

aoe 


Viewed in the unprimed (laboratory) frame, the photon moves upward 
and to the right along a straight line, strikes the mirror, then moves downward 
and to the right before hitting the moving platform. In the unprimed system, 
the platform has moved a distance vdt. Thus, in the unprimed (laboratory) 
system, 

dx = vdt 


dy = 0 
dz =0 


vat \2 
ae Goede) . 
c 
The expression for dt is a straightforward application of Pythagoras’ theorem 
for right triangles. 

Exercise: Draw a diagram showing Event 1 and Event 2 and show that 
the above expressions for dx, dx’, dt, dt’ are correct. 

By assuming that the photon has the same speed in both frames of refer- 
ence, we are forcing the transformation to describe a world in which the speed 
of light does not depend on the motion of its source, i.e., the experimental 
outcome of the pion experiment. 
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If we substitute these relationships into our proposed transformation, we 
have, 
0 = adx + Gdt 


which implies, since dx = vdt, that 
av+ 8 =0. 


The fact that dt’ = 2a can be combined with, 


2,/(2t)? + (de)? 
ra ae ca 


2: 
dt! = 4/1 — dt. 
Cc 


Finally, we can combine this last result with dt’ = ydt + ddz, to show that, 


| 2 
y¥+ou= pa 
C 


Unfortunately, this gives us just two equations for the four unknown co- 
efficients, a, 3,y, 0. So let’s consider a second experiment to get another set 
of two equations. This time, hold the mirror stationary and let the pho- 
ton move exactly vertically up and back in the unprimed system so that for 
Events 3 and 4 (which are not to be confused with Events 1 and 2 in the 
quite different first experiment), 


to show that, 


dx = 0 

dy = 0 

dz = 0 
2dé 
dt = — 

Cc 

and, 
dx’ = —vdt' 
dy’ =0 
dz’ = 0 


2 /(ete)? + (de)? 


2 
Cc 


dt' = 


The minus sign in the expression for dz’ arises because the platform moves 
from left to right. The reflected photon strikes a point behind the point of 
origin in the primed system. 

Exercise: Draw a diagram showing Event 3 and Event 4 and show that 
the above expressions for dx, dx’, dt, dt’ are correct. 

Because dx = 0, the transformation is simplified and we use the dz’ and 
dt’ equations to show, respectively, 


i 
is 
1 
oa = 


—( 1 
Qa = —_— = ae 
U Lee 
= ee. at 


U C2 ee 
c 


We may now write down our transformation as 
dx’ = y(dx — vdt) 


dy’ =0 
dz =0 
a vdx 
We call this transformation the special or restricted Lorentz transformation. 
It is “restricted” because we have constrained all of our motions to be along 
the x, x’ axes. Later we will relax this restriction and derive the full Lorentz 
transformation in three dimensions. 
The special Lorentz transformation can be inverted simply by taking the 
view of an observer in the primed system. From her perspective, the primed 
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system is at rest and the unprimed system moves from right to left with 
speed —v. Thus, 
dx = y(dx’ + vdt’) 


Inertial frames are frames in which there is no physical consequence of as- 
suming that the frame is at rest or in uniform motion and this assumption 
is now built into our Lorentz transformation. Observe also that in the limit 
as u/c — 0, we have y — 1 and the Lorentz transformation becomes the 
Galilean transformation. This is as it should be because the Galilean trans- 
formation serves very well for physics when speeds are much less than the 
speed of light. The speeds in the pion experiment were all at or near the 
speed of light. We have also assumed throughout that the speed of light is 
the same for observers in either the primed or unprimed system. The Lorentz 
transformation thus embodies two essential postulates which can be thought 
of as the foundation of the Special Theory of Relativity, 


1. The Postulate of Special Relativity: Every “law of physics” that 
holds in any reference frame must equally well hold in any reference 
frame that moves in a straight line at constant speed relative to the 
first. 


2. Postulate of the Speed of Light: The speed of light in vacuum is 
the same for observers at rest or moving relative to one another in a 
straight line at constant speed. It is an absolute invariant. 


The Lorentz transformation has a singularity at v — c. But, what would 
be the consequences of velocities that exceed the speed of light? Specifically, 
we focus on the transfer of information at superluminary speeds. Imagine 
two events, A and B, such that A precedes B. We may think of A being a 
“cause” of B and that information generated at the event A must propagate 
to B before B can happen. For example, event A might be the creation 
of a muon and event B might be its decay. The muon itself carries the 
information from event A to event B. For these two events dt4p > 0. The 
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Lorentz transformation tells us that in a primed system moving relative to 
the first with speed v, 


vdz 4 
dt's5 = (dtap - —— 


Ni 


Since dx 4p = udtap, we may write, 


UU 


If there is no restriction on the magnitude of u at which the information 
is carried from event A to event B, ie. u > c, the sign of dt',, may be 
made negative. Then, in the primed frame, the effect precedes its cause. We 
therefore add a third postulate to insure causality, 


3. Postulate of Causality: Information is not propagated at greater 
than the speed of light. 


10.3 Consequences of the Lorentz Transfor- 
mation 


There are several consequences that immediately follow from the Lorentz 
transformation. 


1. Non-Simultaneity of Events: The first consequence has to do with 
what we mean when we say that two events happen “at the same time.” 
Consider a frame of reference in which two events occur. We make the 
following operational definition: Two events which generate a light sig- 
nal are said to be stmultaneous if an observer located midway between 
the spatial positions of the two events receives the light signals coin- 
cidentally. (An “operational definition” is one made in terms of an 
experiment. Thus experiment and not semantics can be invoked to 
decide whether the events are simultaneous. ) 


Now, if two events that occur at different places separated by dx in the 
unprimed (laboratory) frame are simultaneous, then dt = 0. Thus, the 
Lorentz transformation tells us, 


d. 
dt! = (dt — =) 5 Ue 2G, 


C2 
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Two events that are simultaneous in one frame are not simultaneous in 
a frame moving relative to the first with speed v. 


Think of an unprimed system at rest and a primed stystem moving 
from left to right. Think of two event at points A and B (to the 
right of A) that emit light signals just as two points A’ and B’ are 
lined up side-by-side with A and B. These events are such that an 
observer halfway between A and B at point C receives a light signal 
from each coincidentally. We have purposely created two events that 
are simultaneous in the unprimed system according to the operational 
definition. But the observer at C’ (halfway between A’ and B’ in the 
primed system and moving with it) moves forward after the light signals 
are emitted so that the light has less of a distance to travel to meet 
her. The speed of light is unaffected by the motion of the unprimed 
system and the signal from B’ arrives first followed by the signal from 
A’. The observer in the primed system judges that the events are not 
simultaneous, just as the Lorentz transformation predicted. 


. Lorentz Contraction: A second immediate consequence of the Lorentz 
transformation is called the Lorentz contraction. Let us imagine a 
separation dx’ that marks the length of a stick that lies along the 2’ 
axis and is at rest in the primed system. As the stick moves by, an 
observer in the unprimed system wants to measure the length of the 
stick with his own ruler. One way to accomplish this is for the unprimed 
observer to mark the positions of the two ends of the moving stick on 
his stationary ruler as the stick flies by. The markings of the two ends 
are two events which the unprimed observer must make simultaneous 
for his measurement to make sense. Certainly he does not want to mark 
the front end first, then allow some time to elapse before marking the 
position of the trailing end. So, in the unprimed system, dt = 0. The 
Lorentz transformation then tells us, 


dx’ = y(dx — vdt) = ydx 


The measured length, dz, is always shorter than dx’. This we call the 
Lorentz Contraction. The length, dx’, measured in the rest system of 
the stick (primed) is called the proper length of the stick. 
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To the primed observer, the Lorentz transformation yields, 

vu _ da! vdx' 
-E(S) =-“F 40. 
The primed observer judges that the primed observer’s measurement of 
the length of the stick is “incorrect” because, from the primed observer’s 
point of view, the unprimed observer did not mark the ends of the stick 
simultaneously. Indeed, the negative sign for dt’ means that from the 
primed point of view, the trailing end of the stick (A’) was measured 
after the forward edge (B’) so that tg — ta: < 0, ie. tar < tar. 


C? ey 


. Time Dilation: Finally, the Lorentz transformation gives us a result 
called time dilation. The tick of a clock is an event. If the clock is 
at rest in the primed system, two ticks of the clock occur at the same 
place and dz’ = 0. The Lorentz transformation then tells us, 


vdx' 

dt = y(dt’ + 2 dt. 
/ 
= dt 
1-4 


The time interval measured in the rest frame of the clock is called the 
proper time. The temporal interval dt is always greater than dt’ if the 
clock moves relative to the unprimed frame so that the moving clock is 
judged by the unprimed observer to be running slow. 


Time dilation is very commonly observed for radioactively unstable 
particles moving at high speeds. Muons are unstable and decay to 
an electron, a neutrino and an antineutrino. The proper lifetime of a 
muon is 2.2 microseconds. Muons moving near the speed of light have 
a dilated mean lifetime in the laboratory. For muons in cosmic-ray 
showers, the dilation factor, y, may be 10° or more. 


. Addition Law for Velocities: Having replaced the Galilean transfor- 
mation with a more general Lorentz transformation, we must ask what 
relationship replaces the Galilean rule for adding velocities, u/, = u,+v. 
Assume that the primed and unprimed frames are oriented so that the 
special Lorentz transformation holds. Then, 


dx y(de’ + ud!) = t+v 
Us a = caraway — ee 
d+ ee) lb ag 


C2 
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Thus, we have the new addition formulas for velocities, 


ul, + 
ce ea uv 
Ips 
and, similarly, 
1 ow 
a a v 
Neko re 
i es 
i ae uhv 
pe 


Observe that as u/, and v both approach the speed of light, wu, never- 
theless remains less than the speed of light. For ul, <<< cand vu <<, 
the formulas are essentially the Galilean formulas. 


Exercise: Demonstrate that the formulas for u, and uz are correct. 


Notice again the emphasis in each of these examples on the concept of 
an “event.” Once the Lorentz transformation is defined, the consequences 
of Special Relativity follow by systematically identifying events and their 
spatial and temporal separations. Notice also that one must carefully differ- 
entiate between what the primed and the unprimed observer measures for 
these separations. If one is not careful in making the distinctions between 
the viewpoints of the primed and unprimed frames, Special Relativity can 
sometimes seem paradoxical. Many of these apparent paradoxes can be re- 
solved by focussing on the spatial and temporal separation of events as was 
done above. 

Exercise: Consider the following. Perhaps you have seen a pole vaulter 
carrying a pole as he runs down the ramp to approach the crossbar. The 
pole is held horizonal. Imagine an unprimed (laboratory) observer watching 
such a pole vaulter approach a small shed equipped with garage doors, both 
open, at the front and back. The pole vaulter is approaching the front of 
the shed. Let us imagine that the proper length of the pole and of the shed 
is 10 meters, but that the vaulter can run very, very fast. To the unprimed 
observer, the moving pole is Lorentz contracted to half its length, but the 
shed is at rest and is not contracted. So, the five-meter contracted pole fits 
nicely inside the shed and the garage doors could be closed quickly, trapping 
the vaulter and pole inside. However, from the vaulter’s (primed) point of 
view, the pole is at rest and has its proper length of 10 meters. The shed 
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is moving relative to the vaulter and has a Lorentz-contracted length of 5 
meters. The 10-meter pole can never fit inside the 5-meter shed. How then 
can relativity be correct? What two events might you wish to focus on? Are 
these events simultaneous? How is the paradox resolved? 


10.4 Three-Vectors 


The results that we have derived thus far from the Lorentz transformation 
are, for the most part, treated in every introduction to Special Relativity in 
about the way that we have done here. We wish now to develop a very elegant 
and powerful formalism that will allow us to understand Special Relativity 
much more deeply and to derive additional important consequences of the 
theory. To see what we are doing and why, it is probably worthwhile to 
revisit some ideas and formalism associated with ordinary vectors. 

The concept of a vector is often first introduced to represent physical 
quantities that have a magnitude and a direction. We sometimes come to 
think of a vector as being defined by the characteristics of magnitude and 
direction. We often represent a vector by an arrow with the direction of 
the arrow giving the direction of the vector and the length of the arrow 
scaled to represent the magnitude. We will now define a vector in a more 
general and abstract way, which, nevertheless incorporates the characteristics 
of magnitude and direction. 

On a piece of paper, draw a vector a which reaches upward and to the 
right. After drawing a, draw an x-axis horizontally and an 22-axis vertically. 
Observe that the vector existed independently before the coordinate system 
was drawn, but once you draw the axes, the vector acquires components 
in the coordinate system. In a three-dimensional coordinate system, these 
might be represented as (a1, d2, a3). 

Now add a second coordinate system. Imagine x originally along x, and 
x, along x. Then rotate the primed system through an angle 6 counterclock- 
wise around a common 23, 75-axis imagined extending perpendicularly out of 
the page. The vector a has components in the primed system, (a, a5, a4) as 
well, but they are not the same as the components in the unprimed system. 
Indeed, 

a’, = a, cos + agsin 6 


ay = —a,sin@ + ag cosé 
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i, = Gs: 

Since this is a very special rotation, we shall refer to it as a limited or special 
rotation. 

Exercise: Draw a diagram and derive the special rotation transformation 
above. 

The triplet of numbers that represents the vector depends on your choice 
of orientation of the coordinate system. We might write the relationship 
among the components in matrix form, 


a cos@ sin@ 0 ay 
ay |= | —sin@ cosé 0 ag 
a 0 0 1 a3 


The matrix, 
cosd sind 0 
R= | —siné cosé 0 
0 0 1 


is called the rotation transformation matrix for the special rotation. It con- 
nects the unprimed and primed components of a vector when the coordinate 
systems is rotated in the special way described above. In a completely gen- 
eral rotation of the primed systemstem relative to the unprimed system, we 
can represent the matrix by, 


Ry Ry Ry 
R= Roy Ro Ro3 
Ry R32 R33 


We can also represent the transformation in the much more economical form, 
3 
/ = 
a; = S- RG; — Aya: 
j=l 


In what follows, a repeated index, such as 7 in this example, will imply a sum 
on that index. Once the sum has been carried out, 7 disappears altogether 
from the expression and is called a dummy index. The index could, of course, 
be replaced by any other symbol, i.e., Rija; = Riray. The matrix R for a 
general rotation has a number of important generic characteristics which we 
can only summarize here: 
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1. The transformation R is an orthogonal transformation. This means 
that the transpose of the matrix R is the inverse of R, i.e., 


Be ae 
R= RR =, 
These inverse relations may also be expressed, 
Rij Rin = Ojx 
Ry Rei = 05x; 
where 0, is the Kronecker delta function. 


2. The determinant of R = +1. This feature is characteristic of a proper 
orthogonal transformation and includes rotations. If a single coordi- 
nate axis is inverted, the determinant is —1 and the transformation, 
although orthogonal, is said to be improper. 


A scalar under rotations is a single function of position in space. Its value 
at a particular point in space is independent of the coordinate systems chosen, 
although the value of the scalar may be some function of the coordinates and 
may be a different function in different coordinate systems. Temperature is 
a classic example of a scalar function under rotations. 

We can define a 3-vector to be a triplet of components that transforms 
under rotation of the coordinate system according to the specific pattern: 


! 
a; = Fa Gy; 


This definition includes such familiar vectors as displacement, velocity and 
acceleration. 

We can define a 3-tensor to be a nonet of components that transforms 
under rotation of the coordinate system according to the specific pattern: 


Some physical quantities, such as the dielectric tensor, are tensor quantities. 
Tensors of higher order with more than two subscripts can be defined in an 
analogous fashion. In this way, scalars, vectors and tensors are defined by 
the patterns or their transformation under rotations. 
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A scalar which has the same functional form in different coordinate sys- 
tems is said to be an invariant. The scalar dot product of two vectors is an 
example of an invariant, 


a-b= abi = Ria; Rindg = Rij Rina; dp = 05~05 DK = Ay bg. 


Thus, the dot product has the same form and value in either coordinate 
system (a/b! or a;b;). It is an invariant. Since the dot product of a vector 
with itself is the square of its magnitude, the invariance property says that 
the length of a vector does not change when one simply rotates the coordinate 
system. 


10.5 Lorentz Transformations 


The quartet (dx, dy, dz, cdt) are said to be components of a four-vector. Any 
quartet of variables that transforms under a special Lorentz transformation 
in the same way as this quartet transforms is likewise a four-vector under 
Lorentz transformations. We may write the transformation as, 


dx’ = y(dx — vdt) 


dy’ = 
dz’ =0 
vdx 


However, to develop the formalism, it helps to make some notational changes. 
Let us define, 


dx dx! 
dy | _ | dx? 
dz | | dx 
cdt dx* 


We can adapt our summation convention on repeated indices to the Lorentz 
transformation if Greek letters are used when the range of the indices is four 
rather than three. We may then write our special Lorentz transformation as, 


dx’! y 00 -8y dx! 
de? |. 0 10 =O dx? 
dx | 0 01 0 dx? 
dx" =f 00. dx* 


We might also write this as, 
dS dae: 


This is the pattern that defines a 4-vector under Lorentz transformations. 
The inverse Lorentz transformation can be read from 


dx = y(dx' + vdt') 


dy = 0 
a2 = 
d / 
dt = y(at' + =), 
namely, 
A: Aue sy 
gh. 0 10 0 
woe 0 01 0 
ay Os OL ay 


It follows that, 
LL =LILeE=T. 


All of this is done in terms of the special Lorentz transformation. How- 
ever, we can include our discussion of 3-vectors into the 4-vector formalism by 
simply expanding R by one dimension, i.e., the rotation transformation be- 
comes a special case of the Lorentz transformation. For example, the matrix 
for the special rotation transformation is generalized to, 


cos? sind 0 0 
—sin9 cos@ 0 0 
0 0 10 

0 Oe) Ota 


R- 


We can generate more general Lorentz transformations by building them 
from an ordered sequence of special Lorentz transformations (including spe- 
cial rotations). For example, consider a primed frame whose origin does not 
move along the z-axis, but rather moves along a line in the x — y plane. 
The line is inclined by an angle @ to the z-axis. Thus, unlike the special 
Lorentz transformation, the velocity of the primed system has two compo- 
nents, v; = vcos@ and v, = vsin@. However, assume that the x’-axis is still 
parallel to the x-axis and that the y’-axis is still parallel to the y-axis. 
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Begin with the primed and unprimed systems both at rest with respective 
axes parallel. First, rotate the primed system by an angle @. Then give the 
primed system a boost to give it a speed v relative to the unprimed system. 
Finally, rotate the x’-axis back through an angle —é to get it parallel to the 
x-axis again. The overall Lorentz transformation is a combination of these 
three, 


L= R_6L,Re, 
or, L = 
cos@ —sin#@ 0 0 ap Ae ay, cos@ sind 0 0 
sin@ cos? O O 0 10 O —sin@ cos? 0 0 
0 0 1 0 0 01 0 0 0 1 0 
0 0 0 1 Beye Oy 0 0 Ol 


v2 Ug U Vv. 
Lay ae 10. ey 


v2 Vv. 

=|) (Gabe Asha be Ow Sa7 
0 0 1 0 
re =e. Oe. y 


In a similar fashion or simply by extending the patterns that are evi- 
dent from our example, one can generate the general Lorentz transformation 
without rotation, 


Lepiya eye, Jada aby 


: v2 VyV 2) 
peal “Geb tte ee. 7% 
QeDe (SDS syle =e 

= tery — aby — tay y 


This may also be written in 3-vector notation as, 


dr-v 
ee 


dr’ = dr + v[(y — 1) — ydt] 


dr-v 


dt’ = y|dt — 


iB 


Exercise: Use the general Lorentz transformation without rotation to 
derive the expression for dr’ in 3-vector notation and the expression for dt’. 

Exercise: How do ordinary velocities transform under the general 
Lorentz transformation without rotation? 
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The subsuming of rotation transformations into the larger class of Lorentz 
transformations can be made more natural in the following way. Define 
tanh@ = v/c. The function @ is called the rapidity. Since, —1 < tanhé < 1, 
this definition is consistent with all possible frame speeds less than the speed 
of light. Using, 

cosh? 6 — sinh? @ = 1 


sinh 0 
tanh @ = 
7 osh 0’ 
it follows that, 
1 
cosh 6 = ==7 
1-3 
sinh? = no 
C 


We may then write the special Lorentz transformation as, 
dx’ = dx cosh@ — cdt sinh 6 


cdt' = —dz sinh 6 + cdt cosh 6, 


dz \ _ ( cosh@ —sinhé dx! 

dx }~ \ —sinh@  coshé an p 
For special Lorentz transformations, the transformation matrix has the ap- 
pearance of a rotation transformation matrix with hyperbolic functions re- 
placing their trigonometric counterparts. 

The rapidities have an important characteristic that makes them very 
useful. Consider a primed frame, a double primed frame and an unprimed 
frame that are originally identical. Then, boost the primed frame and double 
primed frame together to a velocity uv’ with respect to the first. Finally, boost 
the double primed frame to a velocity uv” with respect to the primed frame. 
The overall Lorentz transformation that relates the double primed frame to 
the unprimed frame is obtained, 


or, 


L=L'l = cosh6” —sinh@” cosh@’ —sinh6@’ 
—sinhé” coshé” —sinh0’ coshé’ 


_ [( coshé" cosh + sinh 6” sinh 6’ =— cosh 6” sinh 6’ — cosh’ sinh 6” 
~ \ —coshé@” sinh 6’ — cosh @’sinh@” cosh @” cosh 6’ + sinh 6” sinh 6’ 
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Thus, 


fies coshO —sinh@ \ / cosh(&’+6”) —sinh(6’ + 6”) 
~ \ =sinh@ cosh@ }  \ —sinh(@’+6”)  cosh(@’ +6”) 


Thus, rapidities have a much simpler addition property than relativistic ve- 
locities. Rapidities add like Galilean velocities! 


10.6 A Brief Tensorial Tutorial 


We noted in passing that the special Lorentz transformation, 


dx’ = y(dx — vdt) 


dy’ = dy 
dz' = dz 
vdx 


dt! = (dt — 2? 


can be compared to the chain rule result, 


Ox’, Ox. 
dx’ = —dzx; + —dt 
: Ox; : Ot 
ot! ot! 
dt’ = —dx; + —dt. 
Oe ¢ Ot 
By comparison of the two sets of expressions and defining dx, = dz', drz = 
dx?,dx3 = dx,cdt = dx* we may write the Lorentz transformation as, 


dat” = Lt dx’. There is an implied sum on the repeated index, v. A four- 
vector is defined to be a quartet that transforms in this way. In fact, this 
pattern must be distinguished from a closely related pattern, so we will call 
it the contravariant transformation pattern. We will distinguish quartets 
that transform according to this pattern by using superscripts and we will 
refer to the quartet as the contravariant components of the four-vector. The 
positioning of the primes tells us that the elements of the matrix L are given 
by, 

_ Oxl" 


i —— os 
Oxr¥ 


Now, we could have done the transformation the other way, 


dx = (dz’ + vdt') 


dy = dy 
dz = dz' 
vda' 


and, the chain rule, 


Ox; P ; P 
dx; Bat dx’; AH dt 
Cte pu: OL 
dt = 5 dx; + art 


If we compare these expressions, we obtain the elements of the inverse Lorentz 
transformation, L~! 


Ox" 

Ox!” : 

We shall usually let the position of the primes alone indicate that this is the 
inverse matrix and write, LL~' = J and L-'L =T as 


Ca 


V 


DYES = oN ’ 

Le 58. 
The symbol, 6\, is the Kronecker delta and it represents the elements of 
the identity matrix. Note again, that a repeated index indicates a sum. In 
virtually all cases in expressions that follow, the repeated index will appear 
once as a subscript and once as a superscript as it does here. This is part of 
the pattern that gives elegance to the formalism. 

The chain rule yields another very important result. Think of a function, 

f = f(a;,t). Then, 


af _ af dn, , af a fon, a, a a), 
Oxi Ox; Ox, Ot Ox’, 


Ox) Ox; — Ox’, Ot 


We may think of this relationship defining a transformation property for the 
derivative operation, namely, 


Observe that this is not the contravariant pattern (the primes on L are in 
the wrong place.) Instead, we will write this expression as, 


Op! = Ln, Op. 


This transformation pattern is called the covariant transformation pattern. 
Quartets which transform in this way are said to be the covariant components 
of a four-vector and they are labelled with subscripts rather than superscripts. 

Similarly, 4-tensors have contravariant and covariant components as well. 
They transform according to the patterns: 


AP = Te Le Aw 
poy 
ee oe 
asp — Dy Lg Apy- 
Exercise: Demonstrate explicitly that 
PU = Ae 


is a form-invariant equation under Lorentz transformations. 


10.7 Relativistic Four-Vectors 


Four-vectors play a very important role in relativity for the following reasons: 


1. All four-vectors share the same Lorentz transformation pattern from 
one frame to another. If you know a thing is a four-vector, you imme- 
diately know how it transforms from frame to frame. 


2. The generalized scalar dot product of two four-vectors is an invariant. 
If you know the magnitude in one frame, you know it immediately in all 
frames. This property is useful if the invariant is very simply evaluated 
in one frame, but much more complicated in another. 


3. If we wish our “laws of physics” to have the same form (covariance) 
in all inertial frames, we must look for laws that express relationships 
among four-vectors. The Lorentz transformation will transform such 
relationships so that they have the same form in all admissable frames. 
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At this point in our treatment, however, we have encountered only one 
four-vector, (dx', dx*, dx*, cdt). We shall see below how to create other useful 
four-vectors beginning with this one. 

If we define ds to be a separation between events, we can generalize a 
result from differential geometry, namely, 


ds* = gijdq'dq — ds? = Jogdx*dx®. 


This pattern to create ds-ds serves as a generalization of a scalar dot product. 
Scalar dot products are scalar invariants under rotations. In four-space, the 
thing which is invariant under the special Lorentz transformation is, 


ds* = dx? + dy* + dz? — edt? = dx? + dy” + dz? — c?dt”. 


You can verify that this is true by direct substitution of the special Lorentz 
transformation. 

Exercise: Demonstrate the invariance of ds? under the special Lorentz 
transformation. 

Therefore, to make the formalism work, we must define our metric tensor 
(in Cartesian coordinates), 


1G 108 02 “0 
Os 0. 
Jae = begs 4a: 3h; 50 

Ors Oe 


Then we can write, 
ds* = gogdx*dx® 


in analogy to the form taken from differential geometry. Indeed, we can 
use curvilinear coordinates rather than Cartesian coordinates. For spherical 
coordinates, the metric tensor would be, 


1 0 0 0 
[or 0 0 
JoB ~ 1 9 Q p2sin2@ 0 
0 0 0 -1 


However, one familiar characteristic of ds? that we give up by this general- 
ization is that, unlike the case in three-dimensional differential geometry, we 


148 


may (and usually do) have ds* < 0! The function ds? is an invariant. It has 
the same form, ds? = g,gdx°dx*, and same value in all admissable frames. 
Observe that the inner product of two four-vectors can be written, 


GipA Be = A" By, =— A 


The practical consequence of the metric tensor for Cartesian four-vectors is 
to associate a negative sign with the fourth-component term in the sum, 


Ow Bb: = A'B! + A’ B? + ASB? = AB = A,B, + AoBo + A3Bz = A,By. 
In the rest frame of a moving clock, 
ds = (0,0, 0, cdt’) = (0,0, 0, cdr). 


Thus, ds? = —c?dr? is the invariant “squared length” of ds. The squared 
proper time, dr”, being equal to the invariant ds? divided by a constant of 
nature, c, is also an invariant. If we attach a frame of reference to a moving 
particle, its velocity in the unprimed system and the velocity of the primed 
frame are the same, u = v. In the laboratory frame, ds = (dz, 0,0, cdt), so 
ae 1d ZOb- 
2 Gro GG F959 V9 € 

ds* = dx* — cdt* = —cdt*(1 Na or 
By the invariance property, the value in the laboratory frame is the value in 
the rest frame, so, 

edt? 

ae 
_ dt 

a 
We have thus recovered the result called “time dilation.” 

A four-vector divided by a scalar invariant is a four-vector because it has 
the transformation pattern of a four-vector. We can create a new four-vector 
by dividing ds by dr. The resulting four-vector is called the four-velocity and 
has components, 


—dr? = 
or, 


at 


dat 
r=, 
dt 
We then have, 
dz! = dx dx 


ea ae 
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dx*  cdt : 
dt ~ adt/y 


We might also write the four-velocity as, 


UY = (Yuz, YUy, YUz, YC) = (yu, yc) = (acsinh 6, ccosh 9), 


where n is a unit vector tangent to the particle motion in three-space. 

Because U is a four-vector, we immediately know that it transforms from 
frame to frame according to the pattern, U“’ = L!’U”. We also know that its 
“squared length,” g,,U"U”, is an invariant. Its value is most easily obtained 
in the rest system of the particle where u = 0 and y = 1. Then, U = (0,0, 0, c) 
and g,,U"U" = —c?. 

Like proper time, the proper length and the proper mass are invariants. 
We may multiply U by the proper mass, mo, to create yet another four- 
vector, the four-momentum,P" = moU" = (moyu, moyc). We immediately 
know that this four-vector transforms according to the usual pattern and we 
know that it has an invariant squared length, 


Gig PY Sane Gig") = mae 


It is customary to combine the factor of y and the proper mass to create a 
relativistic mass, m = ympo and a relativistic momentum, p = ymou = mu. 
We may also define a relativistic energy, E = ymoc? = mc?. Then, the four- 
momentum in an arbitrary frame can be written, P“ = (p, E’/c). Using the 
invariance property of the squared length, we obtain the useful relationship, 


2 
2 E = 22 
Dp Fg Od 


or, 
Ee =p?? + mac. 

Exercise: Derive this latter result, taking care to show that the algebraic 
signs are handled correctly. 

We may create a four-acceleration by differentiating the four-velocity with 
respect to proper time, A” = U“. To maintain form invariance (covariance) 
under Lorentz transformations, the laws of physics must be expressed as 
relationships among four-vectors and, possibly, four-tensors. With m )A“, we 
have a four-vector with which we can build a covariant form of Newton’s 
Second Law. But, first, we need to develop four-vector forces. 
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We can obtain another important four-vector from a different line of rea- 
soning. Consider a plane electromagnetic wave propagating in space. We 
may represent the fields of the electromagnetic wave as, 


A = Aocos(k- x — wt). 


A represents either the electric field, E, or the magnetic field, B, associated 
with the electromagnetic wave. The vector k is the wave number defined by 


where n is a unit vector in the direction of propagation and is the wave- 
length of the wave. The magnitude of k is the number of waves in 27 units 
of length. Since it depends on a measured length, the Lorentz contraction 
implies that k is not a relativistic invariant. Similarly, the frequency, w, of 
the wave is a certain number of radians in a measured period of time and 
because of time dilation is also not a relativistic invariant. 

The quantity k-x—wt is called the phase of the wave. It is a dimensionless 
quantity and is unaffected by either Lorentz contraction or time dilation. The 
phase is a function of a position and a time. We might think of an observer 
making an observation of the wave at (x,t) thus defining an event in space 
time. The primed coordinates of this same event are (x’, t’). The phase of the 
wave determines whether the observers are observing the peak, the trough, 
or some other part of the cycle of the wave. But, since they will both agree 
that they are observing the same part of the wave cycle, the phase must be 
the same for both. The phase is a relativistic invariant, which means that, 


k’. x’ — w/t! =k-x—-ut. 


For simplicity, consider a special Lorentz transformation and an electro- 
magnetic wave propagating along the common z, z’-axes. Then, 


ki da’ — w'dt' = ki |y(dx — vdt)| — w'|y(dt — —)] 


\dx — y(w' + kiv)dt 


vu! 
a angen 
C 


— k,dx — wdt. 
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If the phase is an invariant, we must have, 


/ 
/ / 
Say + za 
ec c c 
But, this is just the pattern for the special Lorentz transformation of a four- 
vector, k# = (k,w/c). We will call this four-vector the four-wavenumber. 
Two important results follow immediately from the Lorentz transforma- 
tion of this four-vector. Imagine electromagnetic waves that are observed 
approaching an observer at an angle, a, in the x, y-plane. Electromagnetic 
waves have an especially simple dispersion relation, that relates the wave 
number and the frequency, namely k = w/c. Because the four-wavenumber 
is a four-vector, we know immediately how it transforms, 


k’ cosa’ 10K Oa k cosa 
kisina’ | | 0 10 O ksina 
0 a 0 O11 0 0 
w/c -*y 00 ¥ w/c 


Cc 


In particular, 
vyw 
k' cosa’ = yk cosa — a 
ae 


v Ww 
— = —-ykcosa+7-. 
C C C 
Using the dispersion relationship for electromagnetic waves, the second of 
these can be written, 


w’ = yw(1— = cos). 
c 


This relationship is the relativistic Doppler shift. It tells what the frequency 
of electromagnetic waves is in a primed frame if those same waves are ob- 
served with frequency, w, in the laboratory and arriving at angle, a. 

The second important relationship comes from dividing the first of the 
transformation equations by the second and using the fact that k’ = w’/c 
and k = w/c. We have then, 

F cosa — v/c 

cosa’ = ————_—_.. 
1 — (w/c) cosa 

This is the relativistic aberation formula and gives the angle of approach, a’, 
in the primed system in terms of the angle in the unprimed system. 
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10.8 The Four- Vector Formulation of 
Maxwell’s Equations 


To develop a Lorentz form-invariant replacement for Newton’s Second Law, 
we need a four-vector expression for force. Of the four fundamental forces, the 
strong and weak forces are quantum mechanical and do not fall in the domain 
of phenomena described by Newton’s Second Law, although the quantum 
field theories that treat them are usually expressed in relativistically covariant 
form. Gravity, on the other hand, is treated in an extension of the Special 
Theory of Relativity called General Relativity. The laws of General Relativity 
are expressed in covariant form, but the treatment goes beyond what we can 
do here. 

We are left with the electromagnetic force. It is the force exerted on 
charged particles by electric and magnetic fields. In nonrelativistic mechan- 
ics, it is expressed as a 3-vector, called the Lorentz Force, 


1 
F = q(E+ -u x B). 
c 


Since the force depends on a combination of electric and magnetic fields, as 
well as the charge, g, and velocity, u, of a particle, it is a somewhat more 
difficult task than we have yet encountered to express the electromagnetic 
force in four-vector form. 

Maxwell’s equations express the relationship between electromagnetic 
fields and the sources (charge and current) that generate them. For our 
purposes, it is not necessary to understand many of the applications of 
Maxwell’s equations to electromagentic phenomena. What we shall show 
is that Maxwell’s equations are already fully consistent with Special Relativ- 
ity. We do this by showing how Maxwell’s equations can be written down in 
terms of four-vectors and four-tensors, thus ensuring that the equations have 
exactly the same form in all coordinate systems of Special Relativity. We 
shall also show how the Lorentz Force is expressed as part of a four-vector. 
With this force as an example, we will then show how Newton’s Second Law 
and Newtonian dynamics are incorporated into Special Relativity. Finally, 
we will conclude with some examples. 

The sources of the electromagnetic field are stationary or moving charges. 
Here we must add another postulate to our theory of Special Relativity: 


4. Postulate of Charge Invariance: The electric charge on an elemen- 
tary particle is an absolute invariant. 
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The charge density, on the other hand, is the amount of charge per unit 
volume, 
Nq 

p= Adz 
where, N is the total number of charged particles, g is the charge on each 
particle, A = dydz is an area that is invariant under the special Lorentz 
transformation, and dz is a differential of distance along the common z, ’- 
axes. For a special Lorentz transformation, dy = dy’ and dz = dz’ so that 
A= A’. However, dx = y(dz' + vdt') so that charge density is not relativis- 
tically invariant. Neverthless, like proper length, proper time and proper 
mass, the proper density, 9, is a relativistic scalar and can be used to create 
a new four-vector (called the four-current), 


J* = poU" = (poyu, poyc): 


As we have done to create relativistic mass, it is customary to define the 
relativistic charge density as p = ypo so that J“ = (pu, pc) = (j, pc). Thus, 
we have defined the relativistic current, j, with units of charge per unit area 
per unit time. In this way we have created a four-vector that describes the 
sources of the electromagnetic field. 

The electromagnetic fields, E and B, do not individually extend naturally 
to become parts of four-vectors. Indeed, it is one of the surprising insights 
of Special Relativity that the six components of these 3-vectors are elements 
of a single electromagnetic field tensor. In a sense, electric and magnetic 
fields are not different things, but rather equivalent parts of the same thing. 
What one observer reckons to be a pure electric field, is reckoned by another 
observer in a different frame to be a combination of electric and magnetic 
fields! 

This time we are going to do it backwards. We are not going to start with 
electromagnetic experiments and develop equations to describe them, which, 
after much work are to be shown to be relativistically consistent. Rather, 
we are going to do a very remarkable thing. We are going to assume as a 
basic premise that the laws of physics must be expressed in relativistically 
form-invariant equations and let that assumption tell us essentially what the 
laws of electrodynamics must be. We will have to make a few “judicious 
choices” along the way so that the laws of physics come out in their familiar 
form, but these are more convention than anything else. 

We begin by assuming that there exists a four-vector, A” = (A, ¢/c). The 
factor 1/c is not essential, but including it here makes it easier to connect our 
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results with the familiar equations of electrodynamics. The four-vector A is 
generated in some way by the sources that we have described by the four- 
current, J. The four-vector A has only four components, so it alone cannot 
incorporate the six components of E and B. Therefore, we must use one of 
the methods we have identified for generating four-tensors from four-vectors. 

We can generate a four-tensor by differentiating a four-vector. For ex- 
ample, T),, = 0,A,, but the result is a tensor that has sixteen components. 
We only need six slots for our electromagnetic fields. However, we could add 
two tensors and create another tensor. Consider, Tj, = 0,A, + 0,A,. This 
tensor is symmetric because T,,, = T,,. Symmetric tensors have ten inde- 
pendent components which you can see if you write down a 4 x 4 symmetric 
matrix of the elements. But then we have ten slots for our six components 
of electromagnetic field. Finally, consider, JT), = 0,,A, — 0,A,. This tensor 
is anti-symmetric because T,,, = —T,,. This means that the four diagonal 
elements of its matrix must be zero. The anti-symmetric tensor has six inde- 
pendent elements and that is just what we need! So, we define the covariant 
components of an antisymmetric electromagnetic field tensor, 


Arranging the electromagnetic fields in just this way is one of the “judicious 
choices” mentioned above. We have done it so that the four-vector equation, 
Tw = OA, — O,A,, is equivalent to the two vector equations, 


10A 

B=—-— errr 
c Ot 
B=Vx<xA. 


These relationships are familiar in electrodynamics. Ordinarily, they are 
used to define the vector potential, A, and scalar potential, ¢, by defining 
their relationships to the more familiar electrical and magnetic fields. In our 
approach here, we introduced the four-vector A” = (A,@/c) first. By ar- 
ranging the components of the electric and magnetic fields in the field tensor 
as we did, we identify that our assumed A is the vector potential of con- 
ventional electrodynamics and that @ is the scalar potential of conventional 
electrodynamics. 
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We can also “raise the indices” to create the contravariant components 
of the electromagnetic field tensor, T#” = g"“g”°Tyg. Then, 


0 Bz; —By —k 
—Bz; 0 B, —£» 
By —B, 0 — 3 
B Be “Be * 


Pi 


Exercise: Derive the expression for 7” by raising the indices of Ty. 
With a little patience you can show that the following two form-invariant 
equations, 


4 
a,7°" = pou” 
( 
Oy LX ots On te OT, =0 


are completely equivalent to Maxwell’s Equations in Gaussian units, 


V-E=47p 
10E 47 
V x B- -— = — 
‘ c Ot a 
V-B= 
10B 
E+ —-— =0 
ae Fe oF 


The first two of Maxwell’s equations have sources on the right as does the 
first of the two relativistic equations. The second two of Maxwell’s equations 
and the second of the relativistic equations are equal to zero on the right. 

Exercise: Demonstrate how V -B = 0 follows from the four-vector form 
of Maxwell’s equations. 

It follows that the Lorentz force, being a function of fields and velocity, 
can be written as some combination of the field tensor and the four-velocity. 
It can be readily verified that the expression, 


1 
Ke = “TU, = (OF, iF -u), 


is the four-vector that contains the Lorentz force, F. The four-force, AK“, is 
sometimes called the Minkowski force. Therefore, the relativistic version of 
Newton’s Second Law in those instances where the force is electromagnetic 


must be, 
dU" 
lreyy, = mo —_. 
c 


at 
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Exercise: Verify the relationship between the Minkowski force and the 
Lorentz force by expanding T*#”U,. 


10.9 Relativistic Particle Mechanics 


Special Relativity deals with frames of reference that move relative to one 
another at constant velocity. Occasionally, the newcomer to the subject 
will have the mistaken idea that Special Relativity only deals with constant 
velocities, while General Relativity is the extension of Special Relativity to 
deal with accelerated motions. This is a misconception. General Relativity is 
an extension of Special Relativity to deal with the force of gravity. The only 
restriction of Special Relativity is that motion, including accelerated motion, 
be described from a frame which is at rest or in uniform motion. Many of the 
applications of Special Relativity are to the acceleration of charged particles 
in electromagnetic fields, either in accelerators or elsewhere in the cosmos. 

Consider a charged particle moving with velocity u in the laboratory 
frame. We may write the four-vector law of motion as, 

dU" 


1 
K# = (oF, -yF - u) = m—_. 
(y oY u) =m a 


Since dr = dt/y, we may write this as, 


(OF, “4F u) = mo (Foow, 50) ; 


1 d d dp 1dB 
(F, -F-u) = (Gomow), (mee) = (eto) 


where, p is the relativistic momentum and F is the relativistic energy of the 
particle. We may separate the single four-equation into two, 


Thus, 


The first is a relativistic statement of Newton’s Second Law, except that p 
is the relativistic momentum rather than the Newtonian momentum. The 
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second of these two equations is a statement that the power, F-u, delivered to 
the particle by the force, F, equals the rate of increase of the total relativistic 
energy of the particle. The total relativistic energy, E = ymoc?, is the sum 
of the kinetic energy of the particle plus its rest mass energy, moc’. 
There is a peculiar consequence of the relativistic Second Law. 
Exercise: Show that 


dy 7° du 
= —su—. 
di. eds 
Exercise: Show that ; 
au 3 
a eet A 
c 
We have then that, 
d du dy > du aie 
— = y— —=yYya suUu— =y7(a, + ay) + usu—, 
Gg te ee i a) 


where a, is the component of acceleration perpendicular to the direction of 
motion and aj is the component of acceleration parallel to the direction of 
motion. 

Now, u- u = uw’. If we differentiate, 


U 
u-a=U—Z = Ua|. 


dt 


Because u and aj are in the same direction, we may use that fact that 
ud), = ua, to write, 


d pu? 
Ho) = yal + (y + oa Jay = yal + Pay. 
Thus, 
F = dp _ Cas 
= Tt =moyat + moy ay = Ma, + May. 


The particle behaves as if it has two different masses, one for parallel acceler- 
ation and one for perpendicular acceleration! It is more difficult to accelerate 
a particle by an electromagnetic force along its direction of motion than it is 
to accelerate it perpendicular to its motion. 
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10.10 The Twin Paradox 


Imagine twins, Albert and Henri, who part one day for separate, extended 
voyages in spacetime. Let Albert stay behind and remain at the origin of his 
stationary, unprimed reference frame, but let Henri move away in a rocket 
ship which defines the origin of his primed frame. After some time, the 
two meet again and compare their ages (clocks). Let us assume that they 
meet where they first start with the origins of the two systems once again 
coincident. The parting and the reuniting define two events in space time for 
which dx = dz’ = 0. Thus, the invariant ds? = —c?dr?. But which clock has 
actually measured the proper time? From Albert’s point of view, his clock 
has remained stationary and has measured the proper time, while Henri’s 
clock has moved and measured a dilated time. But from Henri’s point of 
view, it is Henri’s clock that has remained stationary, while Albert and his 
clock have moved and have measured a dilated time. Do their clocks agree 
or not? How can they agree if each thinks the other has measured a dilated 
time relative to his own? How can they disagree if there is symmetry in their 
points of view? We have here an apparent paradox, often called the twin 
paradox. 

The paradox is resolved in the following way. First, the viewpoints of the 
twins are not necessarily symmetric. If the two are to be reunited, at least 
one of the pair must turn around, i.e., be accelerated during at least part 
of the journey. Special Relativity is a description of motion from a frame of 
reference that is at rest or in uniform motion. Thus, the unprimed frame of 
Albert is an admissable frame for using the relationships of Special Relativity, 
but Henri’s accelerated frame is not during the period of acceleration. Thus 
the symmetry is broken. It is true that ds? = gygdx°dx® = ggaree” , but 
the metric tensor in the primed frame is no longer a function of constant 
values of y and v. Indeed, without some thought, it is not obvious what it 
should actually be. 

Nevertheless, we can calculate a comparison of Albert’s elapsed time rela- 
tive to the proper time measured on Henri’s clock if we use Special Relativity 
in Albert’s frame. Let us assume that Henri’s rocket ship is accelerated by 
some kind of constant force, F = mog. Newton’s relativistic Second Law is 
then, 


d 
F = mg = ql mou). 


If we assume that the acceleration is one-dimensional along the common 
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x, x'-axes, we have 
= af u) 
g a dt v * 


In this instance we can make good use of the rapidity variable by using, 


sinh 6 = a 
c 
cosh 6 = 7 
tanh = ~. 
C 
Then, 
d ld c od dé 


2 ae) ae ota tanh?) “dr 


Assuming a boundary condition of @ = 0 at 7 = 0, we can integrate to obtain, 


pats, 
C 
The four-velocity yields, 
Lb 
Ue = = = ae = (csinh 6, 0,0,ccosh@), 
dt dt dt 
Leé., 
ba = csinh(Z) 
Be cosh(*), 
dt c 


Each can be integrated using appropriate boundary conditions to yield, 


2 
= = (cosh() — 1) 
g c 


Observe that the function for tg is an even function of g, so that it serves as 
well for acceleration and deceleration. 
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If Henri accelerates to a terminal coasting speed, u., we have during the 
coasting period, 


d 
qe) = 0 


from which yu = constant = 7,u,. Then, 


cle 
a ee 


cosh 6, = Ye. 
Again, 


ae dt 
M = —— —— — j 
U (F.0.0.%) (csinh 6, 0,0,ccosh @,), 


from which, 


Le SCF sinh(2““<) 
c 


i= 7 cosh(2“2), 

By combining periods of acceleration, coasting, and deceleration, one can 
demonstrate that when Henri returns to meet with Albert, his (proper) time 
and age will be less than that of Albert who stayed behind. 

Exercise: If g is chosen for comfort to be the local acceleration of gravity 
(which in truly unusual units is 1.03 light years/year/year), how far would 
Henri travel in 22 years of his own time? How much time will elapse on earth 
if Henri makes a round trip by accelerating for 6 years (proper time), then 
decelerating for six years and then reversing the process to come back? How 
much time will he add to the elapsed time on earth if he coasts for a year 
(proper time) on the way out and again for a year on the way back? 


10.11 Conservation Laws 


The force law, 


tells us that for a particle, 
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dt’ 

In the absence of force, the four-velocity is a constant of the motion, i.e., p 
and F are constants of the motion. If u is modest in magnitude relative to 
c, we can expand j, 


1 ee rae 
1—w/2 © 202° BAT 
Hence, 
1 3 5 
B = moc” + smou” + == oe 


The first term exists even when u = 0. We call moc? the rest energy of the 
particle. The remaining terms, beginning with 1/2mou?, are associated with 
motion and together form the relativistic kinetic energy. Because relativistic 
kinetic energy is not directly identifiable with a component of a four-vector, 
it usually plays a less important role in relativistic mechanics than the rela- 
tivistic momentum and the total relativistic energy. 

It is possible within Special Relativity to have a particle for which mp = 0 
if that particle moves at the speed of light. In such a case, both the numerator 
and the denominator of ympo become zero. However, if we let this happen in 
such a way that the total energy of the particle is finite, we have, 


ym? > E 


ym > E/e? 
p = ymoun — (E/c?)(c)n = E/ch. 
The four momentum then becomes, P4 = (E'/cn, E'/c). The photon of quan- 


tum mechanics is such a massless particle for which the total energy is hw. 
The four-momentum of a photon is 


h h 
pla A), 
c Cc 
For a system of n particles, we define the total relativistic momentum 
of the system to be the sum of the relativistic momenta of the individual 
particles of the system, 


P= Pi 
i=1 
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and the total relativistic energy, E;, to be the sum of the relativistic energies 
of the individual particles. If Newton’s Third Law holds relativistically, this 
means that P and E; are conserved in the absence of external forces. 

The center of mass system is defined to be that (primed) system for which 
P’ = 0. The total relativistic three-momentum P and F;/c are the elements 
of a four-vector and transform according to the pattern of four-vectors. This 
means that, 

EF, = 7(E;+v-P’). 


If we choose the primed system to be the center-of-mass system, P’ = 0, we 
have E; = yE}. If we now think of our system of particles as a body made 
up of the constituent particles, we can give it a proper mass, Mo, 


Dp, = y¥Moc? = VE 
from which, 


= nr Mo; n 7! ‘. Vv 
m= 3 (aes ) - 2 moe toot aes 


Here, T/ is the relativistic kinetic energy of the particles in the center-of- 
mass frame and V, represents the potential energies that the particles might 
have as a result of their interaction. Clearly, the proper mass of the body 
is more than the sum of the rest masses of the constituent particles. If the 
kinetic energy of the particles is increased by heating, for example, the mass 
Mo of the body increases! Similarly, if the potential energy of the particles 
decreases as a result of their interaction, the mass of the body might even 
be less than the sum of the rest masses by an amount Am called the mass 
defect. In any case, it is clear that mass and energy are not different things, 
but manifestations of the same thing, mass-energy. 


10.12 Uses of Invariants 


Conservation laws tell us that certain quantities are the same at different 
times. Invariant quantities tell us that certain quantities are the same in 
different frames. Sometimes the two can work together advantageously. 
Imagine a proton accelerator that collides protons on stationary targets. 
Imagine that the intent of the collision is to collide a proton on a proton 
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target to produce a proton-antiproton pair from the kinetic energy of the 
accelerated proton. We want to know how much energy the initial proton 
must have if an proton and antiproton exist after collison in addition to the 
original two protons. In other words, we want to know the threshold energy 
for the process. 

In this case we are not concerned with the dynamics of the interaction 
itself. We are just interested in a relationship between quantities before the 
collision (beam energy) and conditions after the collision (four masses). This 
is something about which conservation laws tell us. Experiments of this type 
are carried out in the laboratory frame, but they are usually most easily 
analyzed in the center-of-mass frame. Invariants tell us something about 
relationships between frames. 

Both in the laboratory frame and in the center-of-mass frame, we have a 
“before” four-momentum and an “after” four-momentum, i.e., four different 
four-momentum vectors. The “before” four-momentum in the laboratory 
frame consists of the 3-momentum of the single moving proton as well as the 
total relativistic energies of the two protons, 


Ey + Mpc? 
Pyilab = (v1.0.0. a 5 


Cc 


The “after” four-momentum (for threshold) in the center-of-mass frame con- 
sists of four stationary particles clustered together. The four-momentum 


is, 
2 
pH = (0,0,0, \. 
C 


a,cm 


Since there are no external forces on the system, four-momentum is con- 
‘ Le — Pp 
served: Ps om = Pin. Thus we have, 


LL v _ LL v = LL V 
Gv Ps tab! lab = Dil y eond bor = Urn irre ee 


The first equality follows from invariance and the second follows from the 
conservation law. Skipping the intermediate step, 
2 (Expy + myc’)? = (dm,c*)? 


1b 2 ay 


ec ce 


However, we may also use the fact that the four-momentum of the original 
beam particle alone forms an invariant connecting the laboratory and the 


164 


rest frame of that single particle, 


2 2.4 
v Ex, Mae 
1b 
Cc? c2 


Combining these two relationships, we have 
Ey, = Tie 


If we add the rest energy of the stationary target, we have 8m,c? in the 
laboratory frame, of which 4m,c? is available in the center-of-mass frame to 
produce particles. 

How much energy would we have in the center-of-mass frame if we accel- 
erated both protons to an energy of 7m,c? and figured out a way to collide 
them head-on. In this case, the laboratory frame becomes the center-of-mass 
frame and all 14m,c? is available to produce particles, i.e. 3.5 times as much 
as before. In terms of four-vectors, 


2E 
Pe (0.0.0, *) 


Plt = (0,0,0,=2) 
é 

from which Eqn = 14m,c?. For this reason, many of the modern accelerators 

are designed as colliders rather than stationary target accelerators, although 

there are other tradeoffs, including the rate of collision events, that also must 

be considered. 

As another example, consider the decay of a neutron. A free neutron 
decays into a proton, an electron and an antineutrino with a proper mean 
lifetime of 14.8 minutes. Suppose we want to know the maximum possi- 
ble energy of the ejected electron. We shall suppose that in this case the 
antineutrino carries away negligible energy and momentum. In this special 
circumstance, it is as if the antineutrino doesn’t exist. 

Let the neutron be at rest, so that the laboratory frame and the center- 
of-mass frame are the same. In the absence of external forces, the four- 
momentum of the system is conserved, 


(0, 0, 0, HC) 6om = (pe + Pp, (E. ae PC aoe = (De —Pbo; 0, 0, (E. te Pon) Cheeta 


and has an invariant scalar product. Thus, we have, 
Pe = Pp 


165 


The four-momenta of the electron and proton taken separately have in- 
variant scalar products, 


Ee 

pi — (2)? = —m2e 
Cc 
E 

ph — (AP)? = —m3e 


Combining these conservation and invariance relationships, 


(m? 2 


2 2 
e _ (m,-— m+ mz)c 
eymax — 


2Mn, 


Since the neutron was assumed to be at rest, we can take this result to 
be a center-of-mass result. If the neutron were moving along the x-axis of 
the laboratory with velocity v, we could transform the result back to the 
laboratory frame to obtain a result for the maximum energy of an electron 
resulting from the decay of a moving neutron. 

Let us imagine now that the electron is emitted at an angle @ relative to 
the x-axis in the laboratory system. We want to know what FE. maz is in the 
laboratory as a function of 6. The value for Ee mar already obtained in the 
center-of-mass frame now becomes E’ Since energy is an element of a 


e,max* 


four-vector, we know how it transforms, 


E. Ee ov 
—= (= — De COs 6) 
C 


Cc Cc 


or, 
Ee Ei oy 
—=Y7 (= + “cost 
C meee 

Because we want an answer in terms of the laboratory angle, we use the 


former of the two. We eliminate p. by using the invariant scalar product 
relationship for the electron, 


pe — (Ee/c*) = —mec’. 
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With p, eliminated, the transformation becomes a quadratic equation in E,, 
for which, after some algebraic simplification, 


€,maxr €,Maxr 
~y (1 — s cos? 0) 


As a final example of this type, let us consider a phenomenon called 
Compton scattering. A photon, originally moving along the x-axis scatters 
from a stationary electron. As a result, the electron recoils at an angle 
@ relative to the x-axis and the photon, with an altered wavelength and 
energy, is scattered to an angle @ relative to the x-axis. We are interested in 
the change in wavelength of the photon as a function of the angle of scatter, 


6. 


Pe ges COS Ee = me (1 — oe cos? 0) 


Eeaens = 


In this instance, nothing is gained by working in the center-of-mass frame 
and all calculations are done directly in the laboratory frame. We have a 
four-momentum for the original photon before the scattering, 


for which gy. P,P. = 0. There is also a total four-momentum before the 
scattering, 


h h 
Phot = Pi. + Ph, = (S00,% + me] 


There is a total four-momentum after the scattering, Pi. + Pru», 


h hw* E- 
Pla = ( = cos ¢, vind — Pe sin @, 0, — +2). 


The scalar product of each with itself is an invariant. Conservation of four- 
momentum tells us that Pf, = Pitot, So that, 


hw \* hw : 
Iw Ph rotP tot = (~) a (~ a mac) = Iu Peto Pa tot: 


We simplify the “after” expression by systematically eliminating four-vectors 
that contain information about the scattered electron for which we are not 
presently concerned. Hence, 


Civ ke sal: tot —_ = Gu (Pe. cm geo Cae Ts Ee sa) 
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= GwPE PY e+ Gav PE wel | nae hte wend oe 


a,e~ a,e a,e~ aw* 


ae —mec + 0+ Ee Cage = Pook, 


==mc + 2G ig P race se —0 
hw hw* h hw 
==—m2c +2 (= “cos — (ee + mec) ( ca ) ; 
é C C 
After some simplifying algebra and using, 
2The 
Pipe 
Ww - 


we obtain the desired Compton scattering result, 


_ Qh 


Ad (1 — cos 6). 


MeC 

There are other kinds of important invariants. The dynamics of an inter- 
action of colliding particles is characterized by a cross-section. Conceptually, 
the cross-section is an area around a target seen by the approaching particle, 
such that if the approaching particle passes within this area, some process 
will take place. For classical Rutherford scattering, alpha particles approach 
gold nuclei and are scattered by Coulomb repulsion. Surrounding the gold 
nucleus like a halo with radius s is a strip of area, do = 27sds in the plane 
containing the gold nucleus but perpendicular to the motion of the alpha 
particle. If the alpha particle passes through this area, it will be scattered 
into an angle between ¢ and ¢+d@. The area da is said to be the differential 
cross-section for this particular process. Similar cross-sections are defined for 
other nuclear processes. 

Since do represents an area perpendicular to the direction of motion of 
the approaching particle, taken to be along the x-axis, the cross-section is 
invariant under the special Lorentz transformation. It provides a simple 
bridge to connect measurements in the laboratory and analysis in the center- 
of-mass system. 

Variants of the differential cross-section are also used. For Rutherford 
scattering, the differential cross-section per unit solid angle is defined, 


do _ msds 8 ds 
dQ xsinddd  singdd’ 
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but this cross-section is not Lorentz invariant because the angle @ is not 
Lorentz invariant. For relativistic purposes it is sometimes convenient to 
note that a four-volume, 


/ 
(dxdydzcdt) = eae dz'(ycdt')) = dx'dy'dz'cdt! 
a4 


is invariant. Since four-vectors have the same transformation properties, it 
follows that, 


EB 
dpodpy dpa 


is an invariant volume in four-momentum space. Thus the cross-section, 


is an invariant. It is a cross-section for a process that produces a particle 
into a certain volume of four-momentum space. 

The proper mass of a system of particles is an invariant with another 
use. Suppose that a proton collides with a proton, producing a shower of 
pions. One suspects that some of the pions result from the decay of a very 
short-lived particle that is also produced in the collision. Let us imagine that 
the short-lived particle is expected to decay into three pions. The detector 
has sufficient resolution to allow one to reckon the relativistic momenta and 
energies of the pions, but not enough to even detect the suspected particle. 

The four-momentum invariant of the suspected particle in its own frame, 
Pi, = (0,0,0, Moc), has a value —M@c?. If one takes the four-momenta of 
the pions, three at a time, forms 


PH = ((pi + pot ps), (21 + Eo + E3)/c) 


and computes g,,P/'P’, and finds that the values cluster around —M¢c’, 
one may take it as compelling evidence for the short-lived existence of the 
suspected particle. 

Thus, invariants are used in a variety of ways to simplify relativistic 
calculations. 
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Computer Exercises 
Ross L. Spencer and Grant W. Mason 


Department of Physics and Astronomy 
Brigham Young University 
Provo, Utah 84602 
In what follows, “ODE” stands for whatever Ordinary Differential Equa- 
tion solver that you have available. 


1. Computer Project: Theory One of the most commonly occurring 
differential equations in physics is the one describing the simple har- 
monic oscillator. In mechanics it occurs when a particle experiences a 
restoring force toward a point of equilibrium that is proportional to the 
displacement away from that point, F = —kx. The differential equation 
arising from Newton’s Second Law becomes, 


Ske = Nf 
or, 
F+uwer=0 , 
with w? = k/m. 


Consider, for example, a spring of negligible mass hanging vertically 
from a support. If you fasten a mass to the spring and gently lower it, 
the spring will extend an additional distance s until the force exerted 
upward by the spring balances the weight of the attached object, 


ks = mq. 


Now lift the spring a distance x in preparation to releasing it and setting 
the system into oscillatory motion. When lifted, the force exerted by 
the spring is k(s — x), so that 


mi = k(s — x) —mg = (ks — mg) — kx = —ka 


since ks = mg. 


If we now let 


L£=v, 
we then have the two first order differential equations, 
C= 
v= —wée, 
which is the form that numerical differential equation solvers use. 


A solution to this set of equations yields x(t) and u(t). You probably 
know already that the equations can be solved analytically and that 


x = Acos(wot — a) 


v = —Auy sin(wot — a). 


A and a are determined by initial conditions. You can verify that 
these are solutions by substituting the solutions into the differential 
equations. 


Since we already know the analytical solutions, let us use the harmonic 
oscillator to illustrate another useful way of looking at motion. Think 
of x and v as being two “coordinates” and make a plot of x versus v. 
Put x along the abscissa and v along the ordinate. Such a plot is called 
a phase plot, each pair of coordinates (x,v) is a phase point, and the 
curve defined by the sequence of points is called a trajectory in phase 
space. 


It is easy to see what the trajectory should be for a harmonic oscillator. 
The energy is conserved, so 


1 1 
E = =mv? + =ka’?. 
rile T 5 x 


Rewrite this in the form 


v? x? 


@E/m) ' RE/R ~” 


which is the form of an ellipse, 


where a and b are the semimajor and semiminor axes. Thus, the tra- 
jectory of the simple harmonic oscillator in phase space is an ellipse. 
Changing the total energy FE changes the semimajor and semiminor 
axes of the ellipse. 


Computer Project 


(a) Analytically find the formula for x(t) that solves this differential 
equation with initial conditions x(0) = 1 and «(0) = 0. Find the 
values of x and & at the time t = 107/wp. 


(b) Let m = 1.0 and k = 2.07 (so that w2 = 2.07) and let £(0) = 0. 
Then let EF = 1.0/47, followed by F = 1.0/87 and FE = 1/16. 
Under these conditions, show that 2(0) = (2E/k)'/?. Plot v(t) 
versus x(t) for several values of FE’ to see if the expected ellipses 
are obtained and to compare them for different choices of EF. In 
this case you don’t want to erase the screen after each computation 
because you want to change something and compare a new graph 
with the old one. You just want to change the initial conditions. 


. Computer Project: Theory If an object falls through a medium 
that exerts a resistive frictional force proportional to some power of 
the speed, we can imagine that its initial motion will be close to free 
fall (since v is still very small), but that the frictional force will grow 
as the speed increases until the frictional force eventually balances the 
weight. At this point the forces are in balance, acceleration ceases 
and the object falls thereafter at a constant speed called the terminal 
velocity. 


Newton’s Second Law for an object of mass m can be written approx- 
imately for a short time interval At as 


Av 
F = Ye 
or, 
FAt = A(mv). 


We can read this to mean that over a short time interval, the change in 
momentum, mv, equals the product of the force times the time interval. 
FAt is called the impulse. 
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Consider a billiard ball falling through the air. The ball has a radius R 
so that as it falls for a short time At it sweeps out a cylinder of cross 
section R? and length vAt. If the density of air is pair, the mass of 
air in the imaginary cylinder is the density of the air times the volume 
of the cylinder, pgi-7R?vAt. To a crude approximation, this air is set 
in motion by collision with the ball and changes its downward velocity 
from zero to the velocity of the ball v. The change of momentum of the 
air equals the impulse, so 


mAv = (Part R?vAt)(v — 0) = FAt. 


Thus, F x v? = bv”. This is the force exerted by the ball on the air. By 
Newton’s Third Law, it is the magnitude of the frictional force exerted 
on the ball by the air. Crudely, then, we conclude that the air might 
exert a force on the ball that is proportional to the square of its speed. 


If this were true, the Newton’s Second Law for the falling billiard ball, 
including gravity, would be, 


mi = mg — bx? 


where we have taken the positive direction to be downward. We can 


write this as ; 
dz b-4 


Separating variables, 


) ane mali =e r fe 


Now, (mg/b) has dimensions of velocity squared, so we will replace it 
with v?. The integral on the left-hand-side is of the form 


1 
[S = ~ tanh71(=). 
por -p p 
So, 
v = vu; tanh(gt/;). 


When t is zero, the hyperbolic tangent vanishes and vp = 0. When t 
gets very large, the hyperbolic tangent approaches one and v > 1%. 
Thus, v;, is the terminal velocity. 
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Here is an only-slightly-crude measured formula for the force on a bil- 
liard ball falling through still air at speed v: 


In |v| 


65 ) 


F = mg —a* vexp (4.4|u| 727" 


In all cases, let a = 1.0 x 107°. The result is a numerical formula in 
SI units, meaning that if v has units of m/sec, then F has units of 
Newtons. The billiard ball has mass m = 0.220 kg. Note that the 
direction of positive displacement x is taken to be down, i.e., gravity is 
a positive force and air resistance is a negative force. 


Computer Project 


(a) Use trial and error on your calculator, or use a numerical solver, 
to find the terminal speed of this ball, i.e., the speed at which the 
force vanishes. 


(b) Use ODE to solve for the motion of the ball if it is dropped from 
rest, i.e., get u(t) and x(t). (You will have to make vp very small 
but not exactly zero to avoid the singularity at In(0).) From the 
plot of v vs t, obtain the terminal velocity. 


(c) Make plots of x and v as functions of time and check to see if they 
are reasonable; describe what you see. Only use a time interval 
long enough to get pretty close to terminal velocity. 


(d) Compare the output from ODE with the prediction of the hyper- 
bolic tangent formula derived above analytically. 


. Computer Project: Theory The differential equation for the one- 
dimensional harmonic oscillator can be modified to include a frictional 
force. For simplicity, assume that the frictional force is proportional 
to the speed of the moving object. With this additional drag force, 
Newton’s Second Law becomes 


mi = —kax — bt 


or, 
# + (b/m)z + wer =0, 


where wi = k/m is the frequency of the undamped oscillator. 
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We can solve this equation by using a trial function, e®’. The parameter 
q@ can be used to make the solution fit our equation. If we substitute 
the trial function into the equation, we get 


(a? + (b/m)a + wee = 0. 


Since e“ does not vanish, the quantity in parentheses must vanish. 

The resulting quadratic equation can be solved for a and will have two 

solutions, 

b b 
+ 4/(——)? — uf. 


Qa=-— 
2m 2m 


If (b/2m)? < wé the quantity under the radical will be negative and 
our two solutions will be complex numbers. Define, 


wy = wo — (b/2m)* = wo(1 — (1/w9)(b/2m)”). 


We can then write our most general solution to the differential equation 
as a linear combination of the two solutions. In doing so, we introduce 
two constants, cy; and C2, that will be determined by the initial condi- 
tions. They will be chosen so that at ¢ = 0, our solution will have the 
correct values for x(0) and «(0). We have, 


a(t) = ere) eee a eee, 


This result can be written in two equivalent forms. First we can use 


a 


et? — coswt tisinwt 
to write our result as 


a(t) = e 4/2 ((e, + cp) coswit + i(c2 — c1) sin wt) 


= ee Gs cos wt + a2 sin wt). 


In this form a, and az are determined by initial conditions. 

Second, we can define B? = a? + a3. Think of this as Pythagoras’ 
Theorem for an imaginary right triangle of which B is the hypotenuse 
and a, and az are adjacent sides such that tan 3 = a2/a,. Then, 


a 
ti) = aoa Cos Wt + > sin w;t) 
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a e !/2™ Bicos Bcoswyt + sin 2 sin w;t) 
= e %#/™ B cos(w it — 8). 
Now B and £ are to be determined by initial conditions. 


In each of the three forms, the oscillation frequency is 


wy = wor 1 — (1/w§)(b/2m)?, 


i.e., the presence of the damping term (parameterized by b) causes a 
shift from the undamped frequency wo. 


We can also use the averaging method to solve our differential equation. 
Think of the differential equation in the form, 


i+wer Sf (oe 


with the right-hand side taken to be a small perturbation on otherwise 
harmonic motion. The averaging method tells us to assume a solution 
of the form, 
x(t) = A(t) cos(wot + o(t)) 
x(t) = —wyA(t) sin(wot + o(t)), 
where A and ¢ are slowly varying functions of time. The method con- 


sists of substituting this assumed solution into the differential equation 
and solving for A and ¢, 


A = Henn LOO) 


o= “mae £) cos(wot + (t)). 


The method then assumes that A and ¢ are varying slowly enough that 
these equations can be averaged on the right-hand-side over one cycle 


of the motion taking A and ¢ to be constant. In our case, f(z,%) = 
—(b/m)zx, so, 


A = —(1/wo)((—(b/m)«) sin(wot + $)) 
§ = —(1/w9)(((—(b/m)#) cos(wot + 4))/A), 
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assuming that A and @ are taken as constants for one cycle of the 
motion. In this case we have, 


A= —(bA/m)(sin?(w, + 4)) 


db = —(b/m)(sin(wot + ¢) cos(wot + @)). 


Since the average of sin*(w» + @) over one cycle is 1/2, and the average 
of the product of a sin and a cos over one cycle is zero, we have, 


A = —(bA/2m) 


o=0. 


The first can be solved by separating variables, 
A= Age (¢" 
and the solution to the second is a constant, ¢o, by inspection. Hence, 


a(t) = Ape~/?" cos(wot + do). 


Computer Project 


To verify the quality of the averaging assumption in the method, con- 
sider the damped harmonic oscillator: 


2 £ 
nh een 
a T 


(a) Use ODE to solve the un-averaged amplitude and phase equations. 
Use w, = 1.0, m = 1.0, x(0) = 1.0, (0) = 0.0 and use several 
values of r = m/b, say T = 1, 3, 10, 100. The unaveraged 
amplitude and phase equations are 


A = —(bA/m) sin? (wot + ) 


@ = —(b/m) sin(wot + ) cos(wot + @). 


Leg 


(b) Describe what the plots of A(t) and ¢(t) look like and verify that 


x(t) = A(t) cos (wot + (t)) 


is indeed very close to the correct solution of the equation for 
7 = 1. Runa solution to the unaveraged phase phase/amplitude 
equations with the given initial conditions. Then add to the plot 
the exact analytical solution, i.e., 


a(t) = e*/?™ B cos(w,t — 8), 
and another for the result of the averaging assumption, 


a(t) = e"/?™ B cos(wot — 8), 


But, remember that for these latter two equations, the values of B 
and @ are determined from initial conditions. The values of B and 
G are not the same in the two equations and must be separately 
determined (analytically) if the comparison of the graphs is to 
make any sense. 


4. Computer Project: Theory What happens if the damped harmonic 
oscillator is driven by some harmonic force of frequency w? The differ- 
ential equation becomes 


mz = —kax — bx + Focoswt 


or, 
+ (b/m)& + wer = (Fo/m) cosuwt. 


Perhaps the slickest way to solve this differential equation is to consider 
a second, companion equation, 


y+ (b/m)y + wey = (Fo/m) sinwt. 


Define the complex number z = x + zy, multiply the second equation 
by 7, add the two together and rewrite as, 


24+ (b/m)z + wez = (Fo/m)e™. 
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If we solve this equation for z, we can reclaim the solution to our 
original equation because x is the real part of z. This method works 
for linear differential equations because the operations d?/dt?, d/dt and 
multiplication by a constant are linear operations. When performed on 
a complex number z, these operations act separately on the real and 
imaginary parts of z, never mixing the two. 


A general solution is a sum of a particular solution and the solution 
to the homogeneous equation. In the previous problem we obtained a 
solution to the homogeneous equation. An obvious candidate for the 
particular solution is something like 


z= ze. 
Substituting this form into the equation to see if it will work, we obtain 
(—w? + i(b/m)w + we) ze! = (Fy/m)e™, 
from which we see that we must require, 


Fo/m 


a (we — w?) + 2(b/m)w 


We can write 2 in polar form, 


; F 20 
% = Re® = —~—___ ee (Fo/m)e 
[(wa — w?)? + (b/m)?w?] 
with, 
—bw 
tan @ = ——_,—... 
en ma = w?) 
If we let a = —0, we can write the general solution to our original 


equation by recovering the real part of z = ze’ and adding it to the 
previously derived solution to the homogeneous equation, 


(f/m) cos(wt — a) 


= ms te B cos(wit — 8), 
[(wo — w*)? + (b/m)?0*) 


where B and ( are determine from initial conditions, 


Wy = wo 1 — (1/w§)(b/2m)?, 
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and, 
bw 


tana = mee 
The second (transient) term dies out exponentially with time leaving 
the first term as the long-term solution. The first term has an amplitude 
that depends on w. It is largest when the denominator is smallest, i.e., 
when w = wg. Thus, the system resonates if driven at or near the 
natural frequency, wp. 


Computer Project 


Explore the properties of the driven-damped harmonic oscillator. The 
differential equation is 


s De 2 Fe 
L+-XL+wex = —coswt 
T m 


where w, is the natural frequency, w? = k/m, and where w is the driving 
frequency. 


(a) Set f/m = 0, x(0) = 2.0, v(0) = 0.0 and use ODE to verify the 
amplitude decay formula 


rx et/2r 


and the damping-modified frequency formula 


Dien gee 
o Ar? 


I recommend using w, = 27 and 7 = 0.2 and running ODE for a 
time interval of 1 or 2 s. 


(b) Leave your model set as in part (a), but change the damping term 
from «/7 to £°/w?r. Run your model with initial values of x = 2.0 
and v = 0.0 for both the model of part (a) and for the model with 
the modified damping term. Use a time interval of 20s. Tell 
how the plots of these two models differ and explain the difference 
qualitatively. (Why does the nonlinear term result in extended 
oscillation instead of exponentially dying away as it does in the 
linear case?) Before continuing with the rest of this problem, 
change the damping term back to the standard «/7 term. 
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(c) Let F,/m = 1, set the damping to zero by making 7 very large, 
and set the driving frequency to w = 0.9w,. Set the initial position 
to x = 0.5 and set the initial velocity to zero. Run ODE for 40 s, 
or so, and plot 2(t). Explain the pattern you see in the plot. (Do 
you know what “beats” are?) Now set 7 = 1 and run ODE for 
20 s. Explain the new pattern displayed in the plot of x(t). 


(d) Scan the driving frequency, w, from w = 0.lw, to w = 10w, and 
use ODE to verify the expected resonance. Run ODE with 7 = 1, 
use a time interval of 20 s, and initially set « = 0 and v = 0. 
It may be easier to get the steady-state amplitude from a data 
file than from the plots. Sketch your results for amplitude as a 
function of w. 


(e) To become more familiar with phase space, make phase space 
plots of the orbits, x versus v. I suggest using a driving frequency 
of about w = 0.95wp. Plot at least five phase space plots on the 
same graph for comparison. Explain why all of them end up doing 
the same thing in spite of their different starting points. We say 
that the motions all approach the same limit cycle. 


5. Computer Project: Theory What happens if the driven, damped 
oscillatory motion occurs with a restoring force more general than F), = 
—kax? Assume that the restoring force is still derivable from a potential, 
F, = —dV/dzx that has either a minimum or a maximum Vo at Zo. 


If we make a Taylor’s expansion around the minimum or maximum, we 
can write, 


V =VW + (dV/dz),,.(" — %o) + 5 (EV /da?)au(2 = 99)? +: 


Since we have chosen to expand around a maximum or minimum, 
(dV /dx),, = 0. If we move our axes so that zo = 0, and let constants 
stand for the derivatives evaluated at the point of expansion, 


i! 1 1 
V=Vot ska® + ahaa” + sTkst! +... 


If the potential is symmetric (V(x) = V(—a)), the odd powers of x 
must vanish because their coefficients vanish. If we assume that the 
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terms in the series are getting smaller as one goes to higher order and 
if we then truncate the series with the quartic term, we have 


dV meee 


We will take k > 0 as it would be for a harmonic oscillator. Depending 
on k3, the cubic term might have a positive or a negative coefficient. 
If the overall coefficient is positive, the potential looks like a rounded 
letter “M” centered on the origin. (Or think of a top view of a nudist 
bending over to pick up his collar button!) It is this case that we want 
to investigate, so we will write, 


F=—kzr + 2°, 


with both k and y positive. Motion will no longer be strictly harmonic, 
but will instead be referred to as anharmonic. 


For our driven, damped anharmonic oscillator, the differential equation 
becomes, 
mi = —kx + yx? — bt + Fy coswt, 


) # + (b/m)z + wer — (y/m)z? = (Fo/m) coswt. 


The equation is now nonlinear, but we might expect that motion of suf- 
ficiently modest energy to be somewhat like the motion for a damped, 
driven harmonic oscillator, i.e, after some time the motion settles into 
periodic motion with the frequency of the driving force, w. We might 
also expect that the amplitude of the motion will get large (resonance) 
if the driving frequence w is close to the natural frequency, wy. 


If the motion is periodic, we should be able to represent it with a 
Fourier series expansion, 


z(t) = a9 + S_(an cosnwt + by sin nwt), 


&(t) = So (—nwa, sin nwt + nwb, cos nwt). 


If we set our origin of time so that « = 0 at t = 0, all of the coefficients 
b, of the cos terms in the expression for must vanish. Furthermore, 
Fourier analysis tells us that 


But this is just the average position of the particle and for a symmetric 
potential such as we have assumed, the average position is expected to 
be zero. So ag = 0. 


The final argument will take some sketches of cos 2wt and an imagined 
periodic x(t). If the motion were truly periodic with frequency w and if 
we were to start it with initial conditions that x7(0) = x9 and «(0) = 0 
in a symmetric potential, then when wt = a, the particle will have 
moved to —2%o and will then return to x by the time wt = 27 as if 
time had simply reversed. (Damping would invalidate this assumption 
since damping is not time symmetric. The test for time symmetry is 
whether changing t and dt to —t and —dt leaves the equation of motion 
unchanged. For the moment we now limit our discussion to undamped 
motion.) The consequence of time symmetric motion is that, 


1 2a 
a2 = - | x(t) cos 2wtd(wt) = 0. 
T Jo 


A sketch of the product of x(t) and cos 2wt shows that for every positive 
piece of the integral, there is a corresponding negative piece that cancels 
it. The argument can be extended to show that a, = 0 for all even 
values of n. 


Thus, our periodic solution for x(t) must look something like, 
x(t) = a, coswt + a3cos3wt+... 


if we choose #(0) = 0 in a symmetric potential well. If the motion is 
close to harmonic, the series will converge rapidly and we can keep just 
the first two terms to get an approximation. We can also assume for 
rapid convergence that a3 << ay. 


In the absence of damping and the absence of a driving force, 
# + wer — (y/m)x? = 0. 


We can substitute our trial solution into the differential equation, 
gather the coefficients of coswt and cos3wt, separately set them to 
zero and solve for w and a3 in terms of a;. The troublesome part of this 
agenda is the handling of 


x? = (a, coswt + a3 cos 3wt)?. 
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Terms involving cos? wt and cos* wt have to be reduced using, 


4 1 3 
cos’ wt = i cos 3wt + ri cos wt 


1 1 1 
cos? wt cos 3wt = ri cos 5wt + 5 cos 38wt + A cos wt. 


Terms with cos 5wt that originate in this process are discarded since 
we have already thrown away higher harmonics in our trial solution. 
Further, since we have assumed that a3 << a, and have thrown away 
all other terms, we are not justified in keeping some other small terms 
that arise in the products obtained by expanding x°. The coefficient a, 
is the largest and is said to be first order small. The coefficient a3 is 
said to be third order small. A term like a? is also third order small, 
aja is fifth order small. Since we threw away everything smaller than 
third order small in our trial solution, we are not justified in keeping 
anything smaller that may appear in the process of expanding 2°. 


We then have, 


[a (we — w) + “(/m)al cos wt 


1 
+ [a3 (we — 9w?) + q6r/m)ay| cos 3wt = 0. 
Since cos wt and cos 3wt are linearly independent, their coefficients must 
separately equal zero if the equation is to be satisfied. 


We use these two equations to solve for w and a3 in terms of a, and the 
other parameters of the problem. We chose the initial condition « = 0 
already, leaving the initial condition on x to determine aj. 


We obtain, 


wah + (/m)ah, 
and, 
(y/m)ay _ (y/m)aj 


“8 3208 + Ba(y/m)ai]——32u3 


But what happens if the damping and driving forces are added? The 
result is both remarkable and surprising. 


Computer Project 
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Consider the driven, damped, anharmonic oscillator 


#=—2+42° — at + f coswt 


where for numerical convenience w, has been chosen to be 1.0 and where 
the size of the nonlinear term has been arbitrarily chosen as well ( 42°). 


(a) With a = 0 and f = 0, find the equilibrium points for this sys- 
tem(points of vanishing force) and tell whether each is stable or 
unstable. Make a phase space sketch of the orbits for this system, 
then make phase space plots for several different initial conditions 
to verify your sketch. When you run the model, I suggest that you 
just choose 7, = —0.4, vo = 0 and a time interval of about 4 s. 
Set the maz and min values of the x-axes to (-0.6, 0.6) and of the 
v-axes to (-0.5, 0.5). You should be able to generate a wide vari- 
ety of interesting orbits. Be careful not to select starting points 
far into the unstable regions or the model will achieve numerical 
overflow and possibly throw you out of ODE. You will then have 
to start over. 


(b) Add damping by setting a = 1.5 and make the same kind of 
phase-flow picture you made in part (a). Describe how the flow 
has changed near the stable and unstable equilibrium points. 


(c) Let a = 0.4, f = 0.115, x = 0, & = 0, and run the model for 
a time interval of 100 s for the following choices of the driving 
frequency, w: 


w= 02 Oa Ay 05, OOS OT, De 8 


Use the results of your computer runs to sketch the amplitude as 
a function of w?. Do you see evidence for resonance? 


6. Computer Project: Theory This project is a continuation of the 
previous one. There is an interesting effect to be seen here, but it 
takes a bit of perserverance. The theory for this project is given in 
the book Newtonian Mechanics, Ralph Baierlein (McGraw Hill, 1993, 
pages 82-87). 


Computer Project 


185 


Consider the driven, damped, anharmonic oscillator 
#=—2+42° -— at + f coswt 


where for numerical convenience w, has been chosen to be 1 and where 
the size of the nonlinear term has been arbitrarily chosen as well ( 47°). 
Let a= 04, 7 =0:115, 75 = 0; 45 =0. 


(a) Try to scan the frequency more carefully past the region near 
w = 0.60 — 0.70. It is in this vicinity that the amplitude has unex- 
pected behavior. If you slowly approach this frequency range from 
below, the amplitude takes a sudden jump, but if you approach 
from above, the jump occurs at a different place! This is not a 
computer glitch, it is the way the system actually behaves. We 
have spent a lot of time trying to make this work, and have only 
been successful by making very long runs. The key is apparently 
to scan the frequency in a special way. If you just do the obvi- 
ous thing and let the frequency change very slowly with time to 
move along the w axis, the time-dependent frequency completely 
changes the dynamics and nothing resembling the figure emerges. 
The apparent right thing to do is to move the frequency up in 
steps, with enough time between the steps to allow the oscillator 
to settle into its limit cycle. This is done by using the ceil (x) func- 
tion (i.e., the integer just greater than the argument) in ODE in 
the following way: 


(wp — Ws) 


a * ceil(.0007958 « t — 0.5) 


W = Ws. + 
where w, is the starting value of the frequency and wy is the final 
value of the frequency. This formula is built to make 20 frequency 
steps between w, and wy if the time interval you have chosen is 
25132.0 s. Note: you will have to type this whole rotten thing 
into the argument of the cosine function, then multiply it by t to 
get cos (wt), so we suggest first writing the coded equation out 
carefully on paper. Check your parentheses carefully! This is a 
long run. We found that a time step of 0.3 s does a reasonable 
job. Make two scans, one with w, = 0.6 and wy = 0.8, and the 
other with w, = 0.8 and wr = 0.6. Try to use a plot of x vs ¢ to 
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determine the time at which the system jumped from one branch 
to another and see if those times were the same or different for 
the two scans. Explain what you find using the term hysteresis. 
It will help you to understand when the jump occurs if you make 
the frequency, given in the formula above, a variable and plot it 
together with x. Then you can see at what frequency the jump 
occurred. (Do not plot the entire x vs t curve. It would take 
forever! Use the computer screen to observe where the transition 
occurs and read the data from the screen or a data table and make 
a sketch for your report.) 


7. Computer Project: Theory The Lagrangian for a simple pendulum 
of mass m suspended by a massless string of length @ is 


1 
Le ym — mgl(1 — cos@). 
The differential equation of motion that follows from this Lagrangian 
is, 
6 + (g/€) sin@ = 0. 


If the angle @ is small, sin@ & 6, and the equation becomes that of a 
simple harmonic oscillator with uw? = g/€. 


If we add a damping force proportional to speed, F; = —b6, we have 
6 + we sin é + (b/m)6 = 0. 
If we add a harmonic driving force, 
6+ w? sind + (b/m)6 = (Fy/m) coswt. 


For certain choices of parameters, including the driving frequency, this 
supposedly simple system exhibits a complex and erratic behavior that 
is called chaos. 


Computer Project 


Study the pendulum with damping and a sinusoidal driving force. Use 
the equation of motion 


§ = —sind — ad + f cos (wt) 


which has a small amplitude oscillation frequency of w, = 1. 
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(a) Study the motion of the pendulum with no damping and no drive 
(a = 0 and f = 0) by using several different initial conditions 
between 6, = 0.1 radians and 6, = 3.13 radians (0, = 0). Explain 
why your plots of @ as a function of t make physical sense. 


(b) Add damping by setting a= 0.2 (keep f = 0) and use an initial 
angle of 4, = 3 and two different initial angular velocities, 6, = 2.2 
and 6, = 2.3. Use a time interval of 100 s. Explain why the phase 
space plots for these two cases are different by explaining what 
the pendulum is doing in each case. 


(c) If the system is driven with the driving frequency set to w = 
0.3 and the drive amplitude set to f = 1, you should see the 
system settle into a limit cycle (use initial conditions 0, = 0 and 
6, = 0 and keep a = 0.2). As f is increased you should see a 
transition to chaos somewhere between f = 1 and f = 1.1. Study 
this transition in as much detail as possible. (You may have to 
select a higher-than usual precision solution option to get as much 
accuracy as possible — the transition to chaos is a very delicate 
business.) The best way to observe chaos is to look at a phase 
space plot. Select an initial point, start it near (0,0), and just let 
it run for a long time (>30 seconds) to see if it settles into a limit 
cycle or if it just bounces around. 


(d) One of the hallmarks of chaos is that two points started close to 
each other in phase space very quickly become widely separated. 
Observe this effect by using f = 1.1 and starting two points close 
to each other and plotting their orbits on the same graph. 


8. Computer Project: Theory The following treatment illustrates a 
general method for solving vibration problems in a linear approxima- 
tion. 


A carbon dioxide molecule is a linear structure with a carbon atom in 
the center and an oxygen atom on each end. We can model the bonds 
that attach the oxygen atoms to the carbon as linear springs of constant 
k. For simplicity let us assume the masses of the three atoms are equal 
to m. When stretched and released, the system begins to vibrate along 
the line joining the atoms and it is this motion that we seek to describe. 


If we assign Cartesian coordinates x1, %2, 73 relative to some arbitrary 
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origin, the kinetic energy of the system is given by, 


. de ek ie 
f= sty + sinats + simaty. 
We can define a mass matriz M with the masses distributed along the 
diagonal elements and zeros elsewhere. If the masses are not equal as 
in this example, M;; = m,0,;, where 0,; is the Kronecker delta. Then 
we can write the kinetic energy as, 


1; ; 
T = 5 ti Misty, 
where, 
My 0 0 
M= 0 m2 O 
0 0 m3 


The potential energy lies in the stretch of the two springs, 


1 
V = gtikiyey 
if we define a matrix 
k —-k O 
K=|—-k 2k —-k 
0 —-k &k 
The Lagrangian is, 
1. ; 1 
L=T-Ve= 3 ti Mish; = gripes, 


from which the differential equations of motion follow, 


Ma; + Ka = 0. 
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(The factor of 1/2 disappears because of the double sum in the La- 
grangian. If you have difficulty seeing it, you should write out the 
sum.) 


We can write the equations of motion in matrix form, 
MX+KX =0, 


where X is a column vector, 
X= oD) 


A trial solution to this set of linear differential equations is, X = 
Xo cos(wt — a). When substituted into the equations of motion we get 
the matrix equation, 


(—w?M + K)Xy = 0 


This set of equations has a solution if and only if the determinant of 
the coefficients vanishes, i.e., 


|K —w?M| =0. 


In our particular case this becomes, 


k — mw? —k 0 
—k 2k — mow? —k = 0. 
0 —k k — maw? 


We evaluate this equation and solve for w?. The resulting values of 
w are the only values for which the trial solution is a solution to the 
differential equations. In our example, (if we set m1; = mz = m3 =m 
for simplicity) there are three values of w?, 


we =0 
wi =k/m 
ts = 3h) mM, 
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and for each there is a corresponding Xo which arises from solving 


(K —w?M)Xo = 0, 


1 1 1 
Xn =| 1] Xo=] 0 Xo3 = | —2 
1 —1 1 


The general solution to our differential equations is a linear combination 
of these three, 


Ly 1 
x2 | =a, | 1 | cos(wit — a4) 
X3 1 
1 1 
+a2| 0 | cos(wot —az)+a3} —2 | cos(w3t — az). 
—1 1 


The w; are called the eigenfrequencies and the Xo; are the eigenvectors. 
The constants a; and a; are determined by initial conditions. In general 
the motion is a complicated combination of oscillations all happening 
at once. But, if you choose the initial conditions just right, you can get 
the system to pick out one of these frequencies to the exclusion of the 
others. These special motions are normal modes of the system. 


Computer Project 


Set up a ODE model for this system. 


(a) Solve the differential equations of motion for several arbitrary 
choices of initial conditions. Plot a(t), vo(t) and x3(t). Let 
my M2 = M3 1 and k = 1. Run the model long enough 
so that you can see the frequencies of the oscillations. 

(b) Solve the equations for the initial conditions X(0) = 0 and 
Xo(0) = Xo2 (the second eigenvector.) Again, plot x;(t) and ob- 
serve the frequencies of oscillation. Describe the motion of the 
system qualitatively in your own words. 


(c) Solve the equations for the initial conditions X(0) = 0 and 
Xo(0) = Xo3 (the third eigenvector.) Again, plot x;(t) and ob- 
serve the frequencies of oscillation. Describe the motion of the 
system qualitatively in your own words. 
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(d) 
(e) 


One of the eigenfrequencies is zero. What does the system do if 
you start it from rest with this eigenvector of positions? 


What differences in parts (a)-(c) do you see if the mass of carbon 
is taken to be 1.0 and the mass of oxygen is taken to be 1.333 
to more realistically simulate a carbon dioxide molecule? Use the 
same initial conditions that you originally used in parts (b)-(c) 
even though these are no longer the exact eigenvectors. 


9. Computer Project: Use ODE to numerically solve for the motion of 
two gravitating masses in the x-y plane. This will require some care 
because the phase space for this problem is 8-dimensional. To make 
things simple, assume that the gravitational potential energy between 
the two masses is given by 


(a) 


(b) 


myzm 
U(r1,r2) = et 


Ir, — r2| 
Write down the eight first-order equations of motion for 
(x1, Yi, Vari, Vy1, V2, Y2, Vx2; Vy2)- 


Make a ODE model that correctly codes your equations of motion. 
This will be difficult and tedious because you will have to put the 
denominator 
[(v1 — 2) * (%1 — La) + (Yr — Yo) * (Y — y) >? 

in each of the four equations for the time derivatives of the ve- 
locities. Use masses m, = 1 and mz = 1.5 and initial conditions 
t= 1, Y= 0, Un) = 0.1, Vyl = 0.7, v2 = —1, Y2 = 0, U72 = 0.12, 
Uyg = —0.55. Make a plot of x; vs. y, ie., look at the orbit of 
mass | in the x-y plane. Explain qualitatively why it looks the way 
it does. For high speed and reasonable accuracy, use At = 0.05 
and run for a time interval of 60 s. 


Define a variable containing the total energy of the system and 
verify that it is constant. 


Define variables containing the x and y components of the position 
of the center of mass. Plot en VS. Yem on a plot — does it look 
like it should? 
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(e) 


Define variables containing the x and y components of the differ- 
ence vector between the two masses, r,—Yr2. Plot the x component 
of this vector vs the y component and verify that the difference 
vector does indeed trace out an ellipse. (Note: ellipses may look 
circular unless the axes on the plot are perfectly chosen, so unless 
you change the axis ranges, you will probably see more or less 
circular orbits. But they are really ellipses. 


10. Computer Project: In the previous exercise we used all 8 variables 
for the motion of two attracting particles in the plane to numerically 
solve the orbit equations. In this exercise we will go to the center of 
mass frame and use conservation of angular momentum to reduce the 
orbit problem to three variables, (r, v,, 0). 


(a) 


Use the law of angular momentum conservation and the radial 
equation of motion to write down three first-order differential 
equations for (r, v,, 0). The purpose of this part is to find or- 
dinary differential equations for ODE to solve. 


The purpose of this part is to find initial conditions for circular 
motion. (See homework Problem 8.2.) Set G= 1, my = m2 = 1, 
and ¢ = mr26 = 1. Under these conditions, solve for the radius 
and angular velocity of a circular orbit. Verify your solution by 
numerically solving the three equations of motion with ODE. Note 
that to get an interesting orbit to plot, you will need to convert 
r and @ into x and y in the usual way by defining them as new 
variables in ODE: 


P= COs. . y =rsin@ 


Make a phase plot with x and y as the two variables. 


The purpose of this part is to find a v,9 to change the orbit from 
the circle of Part (b) to an ellipse. Changing v, does not change 
the angular momentum since it is a change of velocity along the 
line connecting the two particles. Perturb your circular orbit by 
adding just enough radial velocity to make an elliptical orbit with 
the same angular momentum as the circular orbit, but with ec- 
centricity e = 0.5. Verify your solution with ODE. 
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(d) The purpose of this part is to test Kepler’s Third Law by finding 
initial conditions and solving for three ellipses of the same ¢ but 
different perigees. Plot r as a function of time for @ = 0.65 and 
read the period of the motion from the graph. We suggest running 
for about 50 seconds. Estimate the length of the major axis of the 
ellipse from a parametric plot of y versus x. Try three different 
values of r(0) and roughly verify Kepler’s Third Law that relates 
the period of the orbit to the major axis. 


(e) Now modify the inverse square law force term in the radial equa- 
tion of motion and describe what happens to the orbit. This 
will require changing the ordinary differential equations them- 
selves. Some possible suggestions are to replace Gm m2/r? by 
Gm m2/r?**, where 6 is a small number, or to add a small term 
to the force law, e.g., F = Gm mo(1/r? + 6/r*). 


11. Computer Project: You are playing championship basketball on a 
rotating floor. The game is tied with time running out, and you must 
sink a 13.1-foot (4-meter) jump shot to win the game. You are on the 
x-axis, one meter away from floor’s center of rotation, which is located 
directly behind you. The basket is also on the x-axis, 5 meters away 
from the center of rotation. The floor is rotating in the counterclockwise 
direction, viewed from above, at w = 0.4 s~'. You arbitrarily choose 
the vertical speed of your shot to be 7 m/s, giving a total elapsed 
time between the release of the ball and its arrival at the vertical level 
of the basket of 1.2676 s. The problem then is to choose a radial 
velocity and an angular velocity so that when the ball comes down 
through the vertical level of the basket, it is also at r = 5 m and 
§ = 0. A mere mortal would panic, but you have been selected to the 
GCAA Academic All-Galaxy team, so as you dribble between frantic 
defenders, you calmly imagine a blackboard in your mind on which 
you mentally write down the rotating-frame-Lagrangian in cylindrical 
coordinates (r, 0). You correctly deduce that gravity plays no role on 
these coordinates since you have already taken it into account with your 
choice of a vertical velocity of 7 m/s and your calculation of the hang 
time of the ball (1.2676 s). Deriving the (r, 0) equations of motion of 
the ball you discover that there is a conserved angular momentum, and 
you use it to write down three first order differential equations for (r, 7, 
and @). You quickly call up ODE from your upper left frontal lobe and 
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code the three equations. The initial conditions for r and @ are easy: 
r =1 and @=0. But you also must choose the correct initial values 
of 7 and @ so that at time t = 1.2676 s, the ball will arrive at r= 5 m 
and 6 = 0. The equations are nonlinear, so even you can’t solve them 
analytically in the short time remaining in the game, but with your 
new coprocessor implant, you are able to run ODE repeatedly to find, 
by trial and error, the correct initial values of 7 and 6 and win the 
game. What are your winning values? 


This way of using a differential equation solver is called shooting, and 
it is used all the time to solve differential equations where boundary 
conditions are specified instead of initial conditions. 


(Note: it is also possible to solve this problem by getting out of the 
rotating frame, using Cartesian coordinates, and shooting at the mov- 
ing basket. Then you transform back to the rotating frame to get the 
correct values. If you do it this way, you will foul out of the game and 
not win the most valuable sentient life-form trophy. You will also not 
get any points for this problem.) 


Computer Project: Use ODE to solve Euler’s equations for a spin- 
ning object with principal moments of inertia J; = 1, Ig = 2, and 
Tz — 3. 


(a) Show that if the initial conditions are either w, = 1 and the other 
two w’s are small, say 0.05, or w; = 1 and the other two are small, 
then the motion is stable (the other two w’s stay small). Also 
show that if w2 = 1 and the other two are small, then the motion 
is unstable, i.e., the other two w’s don’t stay small. 


(b) Now fasten a rubber band around a book to keep it closed and 
throw it into the air while spinning it about each of its princi- 
pal axes in succession. Tell how what you see makes it possible 
to tell which axis is the unstable axis, and check to see if your 
observations agree with the results you just got from ODE. 


(c) Set w; and w3 to be small and set wy = 1.0. Try to decide whether 
these nonlinear equations have chaotic solutions or regular solu- 
tions. Change the initial w;’s enough times that you are reasonably 
convinced that you know the answer to this question. Do you see 
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“sensitive dependence on initial conditions?” To be unstable is 
not the same as being chaotic. Which do you have? 


Finally, set all three w; = 1.0 and make phase space plots of the 
motion with all three possible pairings of the three w’s. This 
should convince you that classical mechanics, though very nearly 
ancient, is still rich and complex. 
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Appendix A 


What to do before you start 


While physicists spend a lot of time solving mathematical problems, they 
often spend as much time trying to convince themselves that the answers 
they have written down are correct. There are no “answers at the end of the 
book” in research. Physicists who are really good are constantly applying 
tests to their work to convince themselves that what they are doing is correct. 
What they do is a kind of craftsmanship. Like any craftsmanship, it requires 
practice and conscious attention and effort. The following is an attempt to 
list some things that one can do if one is asked to “convince the reader that 
what you have done is correct.” 


A.1 What to do before you start writing 
down equations. 


Guess. Write it down. John Wheeler is said to have warned, “Never begin 
to calculate until you know what the answer is.” A good physicist tries 
to develop intuition about physical systems, an intuition that tells him/her 
what to expect before beginning to calculate. Your guess does not have to be 
precise or even right. If you are seeking a numerical answer, you may only be 
looking for an order-of-magnitude estimate. But you can’t lose by guessing. 
If your guess is right, your calculation will confirm your good sense and you 
will grow in confidence. If your guess is wrong, you will build your intuition 
by correcting your misconceptions by the calculation. Here are some things 
to help with your “guess”: 


LOT 


Draw a diagram, sketch, or rough diagram. Label it with things you 
think are important. 


Ask yourself what the answer could depend upon. For example, what 
could the frequency of a simple pendulum depend upon? Mass? Length 
of string? Phase of the moon? Color of the bob? Temperature? Den- 
sity of air? Size of the bob? Gravitational constant? 


From the above list choose the appropriate parameters and variables 
that you think govern the answer you are seeking. 


Apply dimensional analysis. Is there a combination of the fundamental 
parameters of the system that have the dimensions of the answer you 
seek? Are their “natural” time scales, distance scales, frequencies, etc. 
that present themselves from the parameters of the problem. For exam- 
ple, if, for the simple pendulum, you identify the length of the string @, 
the mass m, and the gravitational constant g as things on which you be- 
lieve the frequency will depend, you may note that @ has dimensions of 
[length], g has the dimensions of acceleration, i.e. [length]/{time]? and 
that, therefore, Jg/t has the dimensions of [time]~*, i.e. the dimen- 
sions of frequency. It is almost certain, therefore, that the frequency of 
the pendulum is set by this “natural frequency.” 


To be more systematic, reduce the dimensions (not the units) of the 
parameters of your system to products of the fundamental dimensions 
of distance, time, mass, electric charge, i.e. to the dimensions of the 
quantities established by standards. This is what we did when we 
established the dimensions of g to be [length|[time]~*. To find what 
combination of m, g, and £ have dimensions of [time]~!, write, 


[ml%g"| = [time]!. 


The square brackets | ] are to be interpreted to mean “dimensions of the 
things inside.”. In this example, a, 3, and y are unknown exponents 
that are to be chosen to make the dimensions on the left-hand-side 
equal to the dimensions of the right-hand-side. Since the dimensions of 
mass are established by a standard, the dimensions of mass can never 
be expressed as combinations of [length] or [time]. We then have, 


[mass]*[length]* [length] [time]~*7 = [time]~* 
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Thus, since the dimensions of mass do not appear on the right-hand- 
side, a must be zero. Similarly, to cancel the dimensions of [length], 
we must have that 6 = —y. And, finally, to get the power on [time] 
correct, we must have y = 1/2. 


e Make an order-of-magnitude estimate. In general, the answer you seek 
to a problem may (but not always) be the one given by dimensional 
analysis but multiplied by some dimensionless number, usually in the 
range 0.1-10. 


e Reason from symmetry. 
e Reason from conservation laws. 


e Solve a simpler problem first. Ignore air friction, the variation of g with 
altitude, the mass of a string, etc. Solving the simpler problem may 
give you a plan for solving the more complicated problem and it gives 
a limiting case to which the solution to the more complicated problem 
must reduce in the limit that certain parameters are allowed to go to 
zero (or 1 or infinity, etc). 


A.2 What to do along the way 


e Use vectors and vector notation to describe the problem economically. 
e Choose coordinates wisely 

1. Choose coordinates that fit the symmetry or the preferred direc- 
tions of the problem. 


2. If necessary, move to coordinates in which the description is sim- 
pler, such as coordinates fixed to a rotating frame or to the center 
of mass of the system. 


e Use constants of the motion such as energy, momentum, angular mo- 
mentum, etc. Sometimes you can replace combinations of variables 
with a single constant. 


e Check your units. Try to find the natural scales of the problem, i.e. the 
combinations of parameters with dimensions of length, time, frequency, 
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velocity, etc. and write variables as dimensionless ratios such as x/Zo, 
t/to, w/wo, v/vo, etc. wherever possible. Remember, the arguments of 
such functions as sin, exp, log should be dimensionless. If you express 
things as dimensionless ratios, you will be able to see that this condition 
is being met. 


If the problem has more than one natural time scale, i.e. something 
about the problem that changes very quickly superimposed on some- 
thing about the problem that is changing much more gradually, you 
may want to separate the problem into two separate parts, solve them 
separately, and combine the results. Sometimes the fast motion can be 
averaged and the average behavior used in solving the slow problem. 


If you do complicated algebraic manipulations on an expression and 
wonder if you have made an algebra mistake, you can sometimes check 
your work with a computer or calculator by substituting randomly 
chosen values of variables separately into the right-hand and left-hand- 
side of the expression and seeing if you get the same number in both 
cases. 


If you reach an impasse in your ability to solve the problem, try: 


1. Dropping small terms. 

2. Ignoring slow variations. 
3. Linearizing the equations. 
4 


. Using a series expansion and keeping leading terms (Taylor, bino- 
mial, etc.). 


Or 


. Iterating. 


6. Guessing a form for the answer with adjustable parameters that 
can be used to fit it to your problem. 


Is your answer reasonable? 
Does your answer agree with your original guess? Why not? 
Does your answer give the correct result for certain simple limits? 


Can you compare an analytic result with a numerical computation for 
some choice of parameters to see if they are the same? 
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e Does your answer agree with one obtained independently by someone 


else? 
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